STATISTICAL REGULARITIES OF SELF-INTERSECTION COUNTS FOR 
GEODESICS ON NEGATIVELY CURVED SURFACES 



STEVEN P. LALLEY 



Abstract. Let S be a compact surface with constant negative curvature —1. From among 
all closed geodesies on T of length < T, choose one at random and let Nt be the number of 
its self-intersections. We prove that for a certain constant k = kt > the random variable 
(jVr — kT'^)/T has a limit distribution as T ^> cxd. We conjecture that for surfaces of variable 
negative curvature the order of magnitude of typical variations is y-^/^^ rather than T. We 
also prove analogous results for generic geodesies, that is geodesies whose initial tangent 
vectors are chosen randomly according to normalized Liouville measure. 
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1. Introduction 

1.1. Self-intersections of random geodesies. Choose a point x and a direction 6* at ran- 
dom on a compact, negatively curved surface T — that is, so that the distribution of the 
random unit vector {x,9) is the normalized Liouville measure on the unit tangent bun- 
dle ST — and let 7(t) = 7(t; x, 9) be the unit speed geodesic ray in direction 9 started at 
X, viewed as a curve in ST. Let p : ST — > T be the natural projection, and denote by 
N{t) = N{'j[0,t]) the number of transversa^ self-intersections of the geodesic segment 
p o 7[0, t]. For large t the number N{t) will be of order t^; in fact, 

(1) lim N{t)/t^ = l/(47r|T|) := kt 

with probability 1. See section |231 below for the (easy) proof. A similar result holds for a 
randomly chosen dosed geodesic [21]: if from among all closed geodesies of length < L 
one is chosen at random, then the number of self-intersections, normalized by L^, will, 
with probability approaching one as L — )- oo, be close to Ky ■ (See |l30l for a related theo- 
rem). Closed geodesies with no self-intersections have long been of interest in geometry 
— see, for instance, 16H5H25| — and it is known t 25ll33| that the number of simple closed 
geodesies of length < t grows at a polynomial rate in t. The fact that there are arbitrarily 
long simple closed geodesies implies that the maximal variation in N{t) is of order t^. The 
problems we address in this paper concern the order of magnitude of typical variations of 
the self-intersection count N{t) about Kyt"^ for both random and random closed geodesies. 
For random geodesies the main result is the following theorem. 

^If the initial point x and direction 6 are chosen randomly (according to the normalized Liouville measure 
on the unit tangent bundle) then there is probability that the resulting geodesic will be periodic, so with 
probability 1 every self-intersection will necessarily be transversal. 
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Theorem 1.1. Let T be a compact surface equipped with a Riemannian metric of negative curva- 
ture. Assume that u = (x, 6) is a random unit tangent vector distributed according to normalized 
Liouville measure on ST, and let N{T) be the number of transversal self-intersections of the geo- 
desic segment 7([0, T]; x, 6) with initial tangent vector u. Then as T — >• oo, 

(2) N{T)-.rT^ 

for some probability distribution if on M (which will in general depend on the surface and the 
Riemannian metric). 

Here — )• indicates convergence in distribution (i.e., weak convergence, cf. lH): a family of 
real-valued random variables Yf is said to converge in distribution to a Borel probability 
measure G on M if for every bounded, continuous function / 



lim Ef{Yt)= [ fdG. 



Since to first order A^(r) is approximately kmT'^, and since T = Vt^, Theorem 11.11 
might at first sight appear to be typical "central limit" behavior (see [31 J for the classical 
central limit theorem for geodesic flows). But it isn't. The limit distribution in ((2]) is a 
limit of Gaussian quadratic forms, and therefore is most likely not Gaussian. Moreover, 
a closer look will show that for central limit behavior the typical order of magnitude of 
fluctuations should be T'^/^, not T. This is what occurs for localized self-intersection counts, 
as we now explain. 

Label the points of self -intersection of 7([0, T]) on T as xi, 2:2, . . . , X7v(r) (the ordering 
is irrelevant). For any smooth, nonnegative function : T — >• M_|_ define the if— localized 
self -intersection count A'^^ (T) by 

N{T) 

(3) iv^(r) = iv^(7[o,r])= J] v'(x,). 

i=l 

Like the global self -intersection count N{T), the localized self-intersection count N^{T) 
grows quadratically in T: in particular, if the initial tangent vector {x,9) is chosen ran- 
domly according to the normalized Liouville measure then with probability one, 

1- ^^(^) II II 
lim = Kr f I 

T-j-oo I ^ 

where ||y?||i denotes the integral of Lp against normalized surface area measure on T. 

Theorem 1.2. for any compact, negatively curved surface T there is a constant e > with the 
following property. Ifip>Ois smooth and not identically but has support of diameter less than 
e, then under the hypotheses of Theorem \l.l\ for some constant cr > depending on 99, 

fl T — > 00. Here <I> is the standard unit Gaussian distribution on M. 
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In the course of the proof we will show that a lower bound for e is the distance between 
two non-intersecting closed geodesies. 

1.2. Self-intersections of closed geodesies. On any compact, negatively curved surface 
there are countably many closed geodesies, and only finitely many with length in any 
given bounded interval [0, L]. According to the celebrated "Prime Geodesic Theorem" 
of Margulis Il23| . ||24|. the number vr(L) of closed geodesies of length < L satisfies the 
asymptotic law 



where /i > is the topological entropy of the geodesic flow. Furthermore, the closed 
geodesies are equidistributed according to the maximal entropy measure, in the following 
sense Il20ll : if a closed geodesic is chosen at random from among those of length < L, then 
with probability approaching 1 as L — )• oo, the empirical distribution of the geodesic cho- 
sen will be in a weak neighborhood of the maximal entropy measure. (See also Bowen [8] 
for a somewhat weaker statement. It does not matter whether the random closed geodesic 
is chosen from the set of prime closed geodesies or the set of all closed geodesies, because 
Margulis' theorem implies that the number of non-prime closed geodesies of length < L 
is 0{e^^^'^).) In addition, the maximal entropy measure governs the statistics of closed 
geodesies even at the level of "fluctuations", in that central limit theorems analogous to 
that governing random geodesies (cf. ||3T| ) hold for randomly chosen closed geodesies - 
see [18J and [19J for precise statements. Thus, it is natural to expect that the maximum 
entropy measure also controls the statistics of self -inter sections. For compact surfaces of 
constant negative curvature the maximum entropy measure and the (normalized) Liou- 
ville measure coincide, so it is natural to expect that in this ease there should be some 
connection between the fluctuations in self-intersection count of closed geodesies with 
those of random geodesies. The next theorem asserts that this is the ease. 

Theorem 1.3. Let T he a compact surface of constant negative curvature, let jl be a closed 
geodesic randomly chosen from the 7r(L) closed geodesies of length < L, and let N{'yL) be the 
number of self-intersections ofjL- Then for some probability distribution onR,as L ^ oo. 



A similar result holds for the number of intersections of two randomly chosen closed 
geodesies. Let 7^ and 7^ be independently chosen at random from the set of closed geodesies 
of length < L, and letA^(7L,7^)be the number of intersections of 7l with 7^ . (If by chance 
7l = 7^, set A^(7l, 7^) = N{jl). Because the probability of choosing the same closed geo- 
desic twice is l/vr(L) — ^ 0, this event has negligible effect on the distribution of A^(7l, 7^,) 
in the large L limit.) 



7r(L) ~ -— as L — )• 00 



(5) 
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Theorem 1.4. IfT has constant negative curvature then for some probability distribution = 
^i/i^ on R, 

(6) ^(7L,7i)-^YL^ ^ 

L 

fls L — 7- oo. 

We shall omit the proof, as it is very similar to that of Theorem ll.3l 

It is noteworthy that the order of magnitude of typical fluctuations in Theorems ll.3j - [L4l 
is L. This should be compared to the main result of [13J, which concerns fluctuations in 
self-intersection number for randomly closed curves on a surface, where sampling is by 
word length rather than geometric length. Let T be be an orientable, compact surface with 
boimdary and negative Euler characteristic X/ arid let T = Ff be its fundamental group. 
This is a free group on (7 = 2 — 2% generators. Each conjugacy class in F represents a 
free homotopy class of closed curves on T. For each such conjugacy class a there is a 
well-defined word-length L = L{a) (the minimal word length of a representative element) 
and a well-defined self-intersection count N{a) (the minimum number of transversal self- 
intersections of a closed curve in the free homotopy class). The main result of [13] states 
that if Q is randomly chosen from among all conjugacy classes with word length L then 
for certain positive constants k* , a* depending on the Euler characteristic, as L — > cx), 

where $ is the standard unit Gaussian distribution. The methods of this paper can be 
adapted to show that the main result of |fT3t extends to compact surfaces without bound- 
ary and with genus g > 2. To reconcile this with Theorem [L3] (which at first sight might 
appear to suggest that fluctuations should be on the order of L, not L^/^), observe that 
when closed geodesies are randomly chosen according to word length L, the order of 
magnitude of fluctuations in geometric length is L^/^; this is enough to increase the size 
of typical fluctuations in self-intersection count by a factor L^/^. 

For surfaces of variable negative curvature the maximum entropy measure and the Liou- 
ville measure for the geodesic flow are mutually singular, so there is no reason to expect 
any connecction with the fluctuation theory for random geodesies described by Theo- 
rem [LT] For reasons that will become clear in the proof of Theorem |1.3[ we expect that in 
fact the fluctuation theory for surfaces of variable negative curvature is entirely diffferent. 

Conjecture 1.5. if T is a compact surface of variable negative curvature, then there exist positive 
constants K*,a such that 

,8, ^^^^4^-* 

where $ is the standard unit Gaussian distribution on M. 
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1.3. Crossing intensities. There is an equivalent formulation of Conjecture 11.51 in terms 
that do not involve fluctuation theory at all, but rather the ergodic behavior of individual 
geodesies. Fix a (compact) geodesic segment a on T (for instance, a [prime] closed geo- 
desic), and for any geodesic ray 7(t) = 'y{t; x, 9) let Nt{a] 7) be the number of transversal 
intersections of a with the segment 7([0, t]). 

Proposition 1.6. Assume that T is a compact surface with a Riemannian metric of (possibly 
variable) curvature, and let fi be any ergodic, invariant probability measure for the geodesic flow 
on ST. Then for each geodesic segment a there is a positive constant K{a; n) such that for /i—a.e. 
initial vector (x, 6) the geodesic ray 7 with initial tangent vector (x, 6) satisfies 

(9) lim = K[a;fi). 

t — ^6 t 

Proof This is a straightforward application of Birkhoff 's ergodic theorem. Fix e > suffi- 
ciently small that any geodesic segment of length 2e can intersect a transversally at most 
once, and denote by G the set of all unit vectors {x,9) G ST such that the geodesic segment 
j{[—e,e], {x,0)) crosses a (transversally). Define g = {2e)~^lG where Iq is the indicator 
function of G. Then 

I / gijs)ds-Ntia;j)\ < 2; 
Jo 

the error ±2 enters only because the first and last crossing might be incorrectly counted. 
Thus, the result follows from Birkhoff 's theorem. □ 

An elementary calculation (see Lemma |Z21 below) shows that if fi = vl is normalized 
Liouville measure then for every geodesic segment a, 

(10) K{a; i^l) = Kr\a\, 

where |a| is the length of a and kx is as in relation (Note: This seems to be known - 
see, for instance, HZ), where it is formulated in different but equivalent terms.) There is no 
reason this should be true for other invariant measures, and we conjecture that isn't. 

Conjecture 1.7. If n is an ergodic, invariant probability measure for the geodesic flow such that 
the ratio K{a; /u)/|a| has the same value for all closed geodesies a, then fi = v^. 

The arguments below will show that the constancy of the ratio ^(a; vl) /|"| is responsi- 
ble for the order of magnitude of fluctuations in Theorems 11.11 and 11.31 If Conjecture 11.71 
is true, then for the maximum-entropy measure ^ = ^max the ratio «;(a; ^)/|a| would be 
constant over closed geodesies only for surfaces of constant negative curvature, since it is 
only in this case that ^max = z^L- Therefore, Conjecture ll. 71 implies Conjecture ll.5[ 

Standing Notation. Throughout the paper, p : ST — > T will denote the natural projec- 
tion from the unit tangent bundle ST to the surface T, and ^{t) = ^{t; v) will denote the 
orbit of the geodesic flow with initial tangent vector v € ST. The letter vr will be reserved 
for the semi-conjugacy of flows constructed in section |3l and (pt for the suspension flow 
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in this construction. Finally, a will be used to denote the unilateral shift on any of the 
sequence spaces S, S"*", etc. used in the symbolic dynamics. 

2. Intersection kernel 

2.1. The intersection kernel. Geodesies on any surface, regardless of its curvature, look 
locally like straight lines. Hence, for any compact surface T with smooth Riemannian 
metric there exists ^? > such that if a and /3 are geodesic segments of length < g then a 
and /3 intersect transversally, if at all, in at most one point. It follows that for any geodesic 
segment 7 of length T the self-intersection number A^(7) = Nrij) can be computed by 
partitioning 7 into nonoverlapping segments of common length 6 < g and counting the 
number of pairs that intersect transversally. Let aj and aj be two such segments; then the 
event that these segments intersect is completely determined by their initial points and 
directions, as is the angle of intersection. 

Definition 2.1. The intersection kernel : ST x ST — > M+ is the nonnegative function 
that takes the value Hs{u, v) = I if the geodesic segments of length 6 with initial tangent 
vectors u and v intersect transversally, and Hs{u, v) = otherwise. 

Assume henceforth that 6 < g; then for any geodesic 7, 

^ n n 

(11) Nt{7) = 2 EE^5(^(^'^)''^(J''^))- 

i=i j=i 

The factor of 1/2 compensates for the double-counting that results from letting both in- 
dices of summation i,j range over all n geodesic segments. The diagonal terms in this 
sum are all 0, because the segment ^{i6) does not intersect itself transversally. 

2.2. The associated integral operators. The intersection kernel Hs{u,v) is symmetric in 
its arguments and Borel measurable, but not continuous, because self-intersections can be 
created or destroyed by small perturbations of the initial vectors u, v. Nevertheless, Hs 
induces a self-adjoint integral operator on the Hilbert space L'^{vl) by 

(12) Hs^{u)= [ Hsiu,v)^iv)duLiv). 

Jvesr 

Lemma 2.2. For all sufficiently small 6 > 0, 

(13) HsHu) := j Hs{u,v)dvL{v) =S'^k 

for all u G M. Thus, the constant function 1 is an eigenfunction of the operator Hs, and conse- 
quently the normalized kernel Hs{u, v) /S'^k^ is a symmetric Markov kernel on ST x ST. 

Remark 2.3. This result (simple though it may be) is the crucial geometric property of the 
intersection kernel. Clearly, the intersection kernel induces an integral operator on L'^{fJ.) 
for any finite measure fi on ST (just replace ui by fi in the definition ([121)). But in general 



8 



STEVEN P. LALLEY 



- and in particular when fi ^ ul is a Gibbs state - the constant function 1 will not be an 
eigenfunction of this operator. 

Proof. Denote by 7 = 7([0, (5];n) the geodesic segment of length 6 with initial tangent 
vector u. For small 6 > and fixed angle 6, the set of points x G T such that a geodesic 
segment of length 6 with initial base point x intersects 7 at angle 6 is approximately a 
rhombus of side 6 with an interior angle 6, with area sin 6*1, and this approximation is 
asymptotically sharp as 5 — > 0. Consequently as 5 — > 0, 

Hs{u,v)dvL{v) ^6^ J ip{e)\sme\d9/{2TT\M\) = 6^k^, 

and the relation holds uniformly for u G ST. 

It remains to show that the approximate equality ~ is actually an equality for small 
6 > 0. Recall that for 6 < g, any two distinct geodesic segments of length 6 can intersect 
transversally at most once. Consider the geodesic segments of length 6 with initial direc- 
tion vectors u and v. For any integer m > 2, each of these segments can be partitioned 
into m non-overlapping sub-segments (each open on one end and closed on the other) of 
length 6/m. At most one pair of these constituent sub-segments can intersect; hence, 

m— 1 m—l 

Hs{u,v) = X] X] Hs/m{7{iS;u),j{j6;v)). 

1=0 j=0 

Integrating over v with respect to the Liouville measure ul and using the invariance of 
relative to the geodesic flow we obtain that 

m—l 

Hsl{u) = ^ mHs/mW{i^;u)). 
1=0 

Let m — 00 and use the approximation ((14)) (with 6 replaced hy 6/ m); since this approxi- 
mation holds uniformly it follows that Hsl{u) = □ 

Lemma 12.21 is the only result from this section that will be needed for the proofs of 
the main results. The remainder of this section is devoted to a proof of the "law of large 
numbers" (HJ and to a heuristic argument for Theorem 11.11 that is simpler and more illu- 
minating than the formal proofs that will follow. 

Lemma 2.4. For each sufficiently small 5 > 0, the integral operator Hs on L'^{i'l) is compact. 

Proof. If the kernel Hs{u, v) were jointly continuous in its arguments u, v then this would 
follow by standard results about integral operators — see, e.g., |38| . Since Hs is not con- 
tinuous, these standard results do not apply; nevertheless, the argument for compactness 
is elementary. It suffices to show that the mapping u 1— )• Hs{u,-) is continuous relative to 
the —norm. Take u,u' & ST, and let a, a' be the geodesic segments of length 6 started at 
u, u', respectively. If u, u' are close, then the geodesic segments a, a' are also close. Hence, 
for all but very small angles 9 the set of points x € M such that a geodesic segment of 
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length 6 with initial base point x intersects a at angle but does not intersect a' is small. 
Consequently, the functions Hs{u, •) and Hs{u', •) differ on a set of small measure. □ 

Lemma 12.41 implies that Hilbert-Schmidt theory (cf. [38J) applies. In particular, the 
non-zero spectrum of Hs consists of isolated real eigenvalues Xj of finite multiplicity (and 
listed according to multiplicity). The corresponding (real) eigenfunctions ipj can be chosen 
so as to consititute an orthonormal basis of L'^{vl), and the eigenvalue sequence \j is 
square-summable. 

Lemma 2.5. The kernel Hs ■= satisfies the Doeblin condition; there exist an integer 

n > 1 and a positive real number e such that 

(15) H'^(u,v)>e forall u,v £ ST, 

where H^^^ denotes the kernel of the n—fold iterated integral operator Hs- 

Proof. Chose n so large that for any two points x, y G T there is a sequence {xj}o<i<n 
of n + 1 points beginning with xq = x and ending at x„ = y, and such that each suc- 
cessive pair are at distance < 5/4. Then for any two geodesic segments a,/3 of 
length 5 on 5 there is a chain of n + 1 geodesic segments Oj, all of length 5, beginning 
aX = a and ending at q„ = (3, such that any two successive segments a, and Oj+i 
intersect transversally. Since the intersections are transversal, the initial points and di- 
rections of these segments can be jiggled slightly without undoing any of the transversal 
intersections. This implies |(T5)) . □ 

Corollary 2.6. The eigenvalue J^k^ is a simple eigenvalue of the integral operator Hs, and the 
rest of the spectrum lies in a disk of radius < 

Proof This is a standard result in the theory of Markov operators. □ 

Corollary 2.7. For every j > 2 the eigenfunction ■0j has mean zero relative to ul, and distinct 
eigenfunctions are uncorrelated. 

Proof The spectral theorem guarantees orthogonality of the eigenfunctions. The key point 
is that V'l = 1 is an eigenfunction, and so the orthogonality ipj _L ipi implies that each ipj 
for j > 2 has mean zero. □ 

Lemma 2.8. If6>0is sufficiently small then Hs has eigenvalues other than and Xi{6). 

Proof Otherwise, the Markov operator Hs would be a projection operator: for every tp G 
L^(z^i) the function Hsip would be constant. But if (5 > is small, this is obviously not the 
case. □ 

2.3. Law of large numbers. The law of large numbers (H) for random geodesies can 
be deduced from Birkhoff's ergodic theorem using the representation (|TT]) of the self- 
intersection count. The first step is to approximate the kernel Hs by continuous kernels. 
Fix < S < Q, where > is small enough that any two geodesic segments on the surface 
T of length g will intersect transversally at most once. 
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Lemma 2.9. For each e > there exist continuous functions , Hj^ : ST x 5T — > [0, 1] such 
that < Hs < Hj^ and such that for each u G ST, 

(16) I {Hj{u,v) - H^{u,v))dMv) < e. 

Proof Fix e' > such that 5 + 2e' < q, and let ip : [0,1] — > [0, 1] be a continuous function 
such that -0(0) = "0(1) = ^rid ip = 1 on the interval [e' , 1 — e']. For unit tangent vectors 
u,v e ST such that the geodesic segments 7^, 7^, of length 6 based at u, v intersect at angle 
9 G (0, vr) at times G [0, 6], set 

and for all other u,v set Hj{u,v) = 0. Similarly, for unit tangent vectors u, u G ST 
such that the geodesic segments 7m,7„ of length 6 + 2e' based at u,v intersect at times 

tu.'tv G + e'), set 

F+(n, v) = ^{tu/{8 + 2e')^{t,/{5 + 2e')), 

and for all other u, v set H'^{u, v) = 0. Clearly, < < Hs < H'^ , and by an argument 
like that in the proof of Lemma l2!2l it can be shown that if e' > is sufficiently small then 
CSll will hold for all u. □ 

Proposition 2.10. Let {X,d) he a compact metric space and let K : X"^ ^ M. he continuous. 
If p. is a Borel probability measure on X and T : X ^ X is an ergodic, measure-preserving 
transformation (not necessarily continuous) relative to p, then 

(17) 1™ ^ V V K{rx, Px) = [[ K{y, z) dp{y)dp{z) 
for p— almost every x. 

Proof. The function K is bounded, since it is continuous, so the double integral in ((17)| is 
well-defined and finite. Furthermore, the set of functions defined by Kx{y) := K{x, y), 
where x ranges over the space X, is equicontinuous, and the function 



:= / K^{y)dp{y) 
Jx 



is continuous in x. The equicontinuity of the functions implies, by the Arzela-Ascoli 
theorem, that for any e > there is a finite subset = {xj}i<j</ such that for any x G SM 
there is at least one index i < I such that 

1 1 Kx 1 1 00 ^ £■ • 



It follows that the time average of Kx along any trajectory differs from the corresponding 
time average of Kx^ by less than e. Since the set is finite, Birkhoff 's theorem implies 
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1 " r 

lim - "S^ K{y,T^ x) = K{y, x') dfi{x') for each y G F^. 

Consequently, it follows from equicontinuity (let e — )■ 0) and the continuity in x of the 
averages K,j. that almost surely 



1 " r 

lim - Vi^(y,r^x) = / Kiy, 



x) d^{y) 



uniformly for all y G X. The uniformity of this convergence guarantees that (flT)) holds 
/i— almost surely. □ 

Proof of the strong law of large numbers ^ . Let H'^ and HJ be as in the statement of Lemma l2!9l 
By Proposition l2.10[ for v^ — almost every u e ST, 

^ n n „ 

^™ ^y]y]-f^^(7«(^'^),7M(i'5)) = / Hf{v,w)dvL{v)dvL{w). 

n— >-cxD n ^ — ' ^ — ' I 
i=l j=l 

Hence, by Lemma IZ91 (let e' — 0) and Lemma IZ2l for v^— almost every u G ST, 

lim V'-H'5(7n(i'5),7n(j<J)) = / Hs{v,w)duL{v)duL{w) = 5^K. 

j=l j=l 

This proves that ^ holds for t — oo along the sequence t = n6. Since > is arbitrary, 
and since the self-intersection counts are obviously monotone in t, relation ^ holds for 
t — oo through the reals. □ 



2.4. Weak convergence: heuristics. The results of sections 12.11 - 42.21 can be used to give 
a compelling — but non-rigorous — explanation of the weak convergence asserted in 
Theorem ll.il The Hilbert-Schmidt theorem asserts that a symmetric integral kernel in 
the class L'^{vl x i^l) has an L^— convergent eigenfunction expansion. The intersection 
kernel Hs{u, v) meets the requirements of this theorem, and so its eigenfunction expansion 
converges in L'^{ul x ul): 

oo 

(18) Hs{u,v) = y^^\k^pk{u)'>Pk{v)■ 

k=l 

The L^— convergence of the series does not, of course, imply pointwise convergence; this is 
why the following argument is not a proof. Nevertheless, let's proceed formally, ignoring 
convergence issues. Recall (Corollary I2.7|) that the eigenfunctions are mutually uncorre- 
lated, and so all except the constant eigenfunction V'l have mean zero relative to u^. Thus, 



12 



STEVEN P. LALLEY 



the representation ([TT]) of the intersection number N^{n5) can be rewritten as follows: 



^ n n 

(19) N^{n5) - {ndfng = - J] i/^lTl^^^), tO'^)) " {nSfn^ 

i=i j=i 

^ n n oo 

i=l j=l k=2 
^ oo / " \ ^ 



2 

fc=2 \i=l 



If the eigenfunctions tpj were Holder continuous, the central limit theorem for the geo- 
desic flow l|31l| would imply that for any finite K the joint distribution of the random 
vector 



/ n 

(20) i^Y^^kms)) 



2<k<K 



converges, as n — )• oo, to a (possibly degenerate) /C— variate Gaussian distribution cen- 
tered at the origin. (The central limit theorem in [31 J is stated only for the case K = \, but 
the general case follows by standard weak convergence arguments [the "Cramer-Wold 
device"], as in [4J, ch. 1.) Hence, for every K < oo the distribution of the truncated sum 



K n Y 

(21) -Y.^m\Y.'^mm\ 

k=2 \i=l J 

should converge, as n — oo, to that of a quadratic form in the entries of the limiting 
Gaussian distribution. 

Unfortunately, it seems that there is no way to make this argument rigorous, because 
there is no obvious way to show that the series ((18)) converges pointwise. (If the kernel Hs 
were normegative semi-definite then Mercer's theorem might be applied, in conjunction 
with a smoothing argument; however, Hs is in general not normegative semi-definite.) 
Thus, it will be necessary to proceed by a more indirect route, via symbolic dynamics and 
thermodynamic formalism. 



This would follow by the spectral theorem for symmetric matrices and elementary properties of the 
multivariate Gaussian distribution. To see this, suppose that the limit distribution of the random vector (20) 
is mean-zero Gaussian with (possibly degenerate) co variance matrix E; this distribution is the same as that 
of Ti^''^Z, where Z is a Gaussian random vector with mean zero and identity covariance matrix. Let A be the 
diagonal matrix with diagonal entries \j{S). Then the limit distribution of (2\\ is identical to that of MZ, 
where M = E^/^AE^''^. But the matrix M is symmetric, so it may be factored as M — DU, where U is an 
orthogonal matrix and D is diagonal. Now if Z is mean-zero Gaussian with the identity covariance matrix, 
then so is UZ, since U is orthogonal. Thus, Z^MZ is a quadratic form in independent, standard normal 
random variables. 
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3. Symbolic dynamics 

3.1. Shifts and suspension flows. Symbolic dynamics provides an approach to the study 
of hyperbolic flows that transforms questions about orbits of the flow to equivalent (or 

nearly equivalent) questions about a shift of finite type. The shift of finite type (either one- 
sided or two-sided) on a finite alphabet A with transition matrix ^4 : ^ x ^ ^ {0, 1} is the 
system (S"*", a) or (S, a) where 

S+ = {{xn)n>o G I A{xn,xn+i) = 1 V n > 0} and 

S = {ixn)nez G A^ I A{Xn, x^+i) = 1 V n G Z}, 

and a is the forward shift operator on sequences. (Equivalently, one can define a shift of 
finite type to be the forward shift operator acting on the space of all sequences with entries 
in ^ in which certain sub words of a certain length r do not occur.) If there exists m > 1 
such that A"^ has strictly positive entries then the corresponding shifts are topologically 
mixing relative to the usual topology on S or S+, which is metrizable by 

(i(a;, y) = 2-"(^'^\ 

where n{x, y) is the minimum nonnegative integer n such that Xj / yj for j = ±ra. 

For a continuous function F : S ^ (0, oo) on S (or on S+), the suspension flow under F 
is a flow on the space 

Sf := {(x, i) : x G S and < i < F{x)} 

with points (x,F(x)) and (ax, 0) identified. In the suspension flow an orbit beginning 
at some point (x, s) moves at unit speed up the fiber over (x, 0) until reaching the roof 
(x, F{x)), at which time it jumps to the point {ax, 0) and then proceeds up the fiber over 
(crx, 0), that is 

(f)t{x,s) = {x,s + t) iis + t<F{x); 
(t)t{x, s) = 4>t-F(x)+s{<^x, 0) otiierwise. 

We shall use the notation 

■ = {(,x,s) : s G [0,F(x))} and 
: = M^x) 

for fibers and their time shifts. An orbit of the suspension flow is periodic if and only if it 
passes through a point (x, 0) such that x is a periodic sequence; if the minimal period of 
the sequence x is n, then the minimal period of the corresponding periodic orbit of (pt is 
the sum of the lengths of the fibers visited by the orbit, which is given by 

n— 1 

(22) 5„F(x) := J]F(c7^x). 

j=0 

The term "symbolic dynamics" is used loosely to denote a coding of orbits of a flow by 
elements x G S of a shift of finite type. In the case of a hyperbolic flow, this coding extends 
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to a Holder cor\tinuou^ semi-conjugacy with a suspension flow over a shift of finite type 
with a Holder continuous height function F. Existence of such semi-conjugacies was 
proved in general for Axiom A flows by Bowen ||9J, and by Ratner 1321 for geodesic flows 
on negatively curved surfaces. 

Proposition 3.1. (Ratner; Bowen) For the geodesic flow on any compact surface T with a 
Riemannian metric of negative curvature there exists a suspension flow {Y,F,4't) over a shift of 
flnite type and a Holder continuous surjection vr : Si? — > ST such that 

(23) vr o = 7i o vr for all t e M. 

The suspension flow {T,f, (f)t) and the projection vr can be chosen in such a way that the following 
properties hold: 

(A) For some N < oo, the mapping it is at most N—to—1. 

(B) The suspension flow and the geodesic flow have the same topological entropy 9 > 0. 

(C) For some e > 0, 

(i) the projection vr is n— almost surely one-to-one for every Gibbs state /i with entropy 
larger than 9 — e, and 

(ii) The number M{t) of closed geodesies with more than one 7r—pre-image and prime 
period < t satisfles 

limsupt"MogM(t) <9-e. 

t—>oo 

For the definition of a Gibbs state, see section |4] below; both the Liouville measure and 
the maximum entropy invariant measure are Gibbs states. The conclusions (C)-(i) and (C)- 
(ii) are not explicitly stated in IfTOll , but both follow from Bowen's construction. See Il27| . 
sec. 3 for further discussion of this point. Finally, observe that if tt is a semi-conjugacy as 
in Proposition |3]ll then so is the mapping tTs = vr o 0^, for any s G M. (See Remark l3.8l for 
implications of this.) 

3.2. Series' construction. For the geodesic flow on a compact surface of constant negative 
curvature, a different symbolic dynamics was constructed by Series [35J (see also | |TT| , and 
for related constructions 111 and l|22ll ). This construction is better suited to enumeration 
of self-intersections. In this section we give a resume of some of the important features of 
Series' construction. 

Assume first that T has constant curvature —1 and genus g > 2. Then the universal 
covering space of T is the hyperbolic plane D, realized as the unit disk with the usual 
(Poincare) metric. The fundamental group T = vri(T) is a discrete, finitely generated, co- 
compact group of isometries of D. Thus, T can be identified with D/T. This in turn can 
be identified with a fundamental polygon V, with compact closure in D, whose sides are 
geodesic segments in D that are paired by elements in a (symmetric) generating set for 

^The implied metric on the suspension space Sf is the "taxicab" metric induced by the flow (f>t and the 
metric d on E specified above - see Il2l for details. The metric on ST is the metric induced by the Riemannian 
metric on TT - see, e.g., |29J, sec. 



SELF-INTERSECTIONS OF CLOSED GEODESICS 



15 



r. The polygon V can be chosen in such a way that the even corners condition is satisfied: 
that is, each geodesic arc in dV extends to a complete geodesic in D that is completely 
contained in Ug^rgidV). The geodesic lines in Ug^rg{dV) project to closed geodesies in 
T; because the polygon V has only finitely many sides, there are only finitely many such 
projections. Call these the boundary geodesies. 

Except for those vectors tangent to one of the boundary geodesies, each unit tangent 
vector V G ST can be uniquely lifted to the unit tangent bundle SO of the hyperbolic 
plane in such a way that either the lifted vector L{v) has base point in the interior of V, 
or lies on the boundary of V but points into V. The vector L{v) uniquely determines a 
geodesic line in D, with initial tangent vector L{v), which converges to distinct points on 
the circle at infinity as t ^ ±00. The mapping L : ST — SO is smooth except at those 
vectors that lift to vectors tangent to one of the boundary geodesies; at these vectors, L 
is necessarily discontinuous. Denote by H C ST the set of all vectors v such that L{v) is 
based at a point on the boundary of V. 

For any shift (S^, cr), any x G S or S^, and any subset J C let T,j{x) be the cylinder 
set consisting of all y G such that xj = i/j for all j G J. For any sequence x G S, denote 
by 

= X0X1X2 ■ ■ ■ and X = x_ix_2X_3 • • • 
the forward and backward coordinate subsequences. The sequence x~ need not be an 
element of since its coordinates are reversed. Let be the set of all x~ such that 
x G S; then a) is a shift of finite type (whose transition matrix is the transpose of 
A). 

Proposition 3.2. (Series) Let T he a compact surface equipped with a Riemannian metric of con- 
stant curvature —1. There exist a shift (S, a) of finite type, a suspension flow (T,f, <t>t) over the 
shift, and surjective. Holder-continuous mappings ^± : T,^ — > 3D, and vr : S^? — )• 5T such that 
IT is a semi-conjugacy with the geodesic flow (i.e., equation ((23)) holds), and such that the following 
properties hold. 

(A) H = 7r(S X {0}). 

(B) The endpoints on dB> of the geodesic with initial tangent vector L o 7r(x, 0) are (,±{x^). 

(C) F{x) is the time taken by this geodesic line to cross V. 

Furthermore, the maps ^± send cylinder sets onto closed arcs J^{x^) such that for 

certain constants C < 00 and < /3i < /32 < 1 independent ofm and x, 

(D) the lengths of J^{x^) are between C/3™ and C(3^, and 

(E) distinct arcs J^{x~^) and Jm{y~^) of the same generation m have disjoint interiors (and 
similarly when + is replaced by —). 

Consequently, the semi-conjugacy it fails to be one-to-one only for geodesies whose lifts to D have 
at least one endpoint that is an endpoint of some arc J^i^)- Finally, all but finitely many closed 
geodesies (the boundary geodesies) correspond uniquely to periodic orbits of the suspension flow, 
and for each nonexceptional closed geodesic the length of the representative sequence in T, is the 
word length of the free homotopy class relative to the standard generators o/7ri(T). 
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See ||35|, especially Th. 3.1, and also ITTTI . The last point is important because it implies 
that the set of geodesies where the semi-conjugacy fails to be bijective is of first category, 
and has measure zero under any Gibbs state (in particular, under the Liouville and maxi- 
mum entropy measures). 

Series' construction relies heavily on the hypothesis that the underlying metric on T is 
of constant negative curvature. However, the key features of her construction carry over 
to metrics of variable negative curvature, by virtue of the conformal equivalence theorem (see, 
for instance, [34J, ch. V) for negatively curved Riemannian metrics on surfaces and the 
structural stability theorem for Anosov flows 111, Il26l , ||14| . Structural stability applies only 
to small perturbations of Anosov flows, and only geodesic flows on negatively curved 
surfaces are Anosov, so to use structural stability globally for geodesic flows we must be 
able to show that there is a deformation (homotopy) taking one Riemannian metric to an- 
other, through metrics of negative curvature The following easy proposition (undoubtedly 
well known, but not explicitly stated in [Ml) shows that for surfaces, conformal equiva- 
lence of negatively curved metrics implies the existence of a smooth deformation. 

Proposition 3.3. Let qq , qi be C°° Riemannian metrics on T, both with everywhere negative 
scalar curvatures. Then there exists a C°° deformation {Qt}telo,i] through Riemannian metrics 
with everywhere negative scalar curvatures. 

Proof. The conformal equivalence theorem (||34|. ch. V, Th. 1.3) implies that there exists 
a strictly positive C°° function r = e^" on T such that qi = tqq. The scalar curvatures 
Kq, Ki are related by the equation 

Ki = e-'^'^{KQ - 2 An) 

where A is the Laplace-Beltrami operator with respect to qq. Since Kq and Ki are both 
negative everywhere, it follows that JTo—t An < Oforeveryt G [0,1]. Thus, if := e~^*"^o 
then the curvature Kt = e~^*"(ii'o — tAn) is everywhere negative, for every t. □ 

Fix Riemannian metrics arid qi of negative curvature on T such that qq has constant 
curvature -1, and let Qs be a smooth deformation as in Proposition 13.31 The geodesic 
flow on ST with respect to any Riemannian metric Qs of negative curvature is Anosov, 
and if the metrics Qs vary smoothly with s then so do the vector fields of their geodesic 
flows. Hence, by the structural stability theorem, for each s G [0, 1] there exists a Holder 
continuous homeomorphism <I>s : ST — > ST that maps ^jq— geodesies to geodesies. 
The Holder exponent is constant in s, and the homeomorphisms <I>s vary smoothly with 
s in the Holder topology [14J. Consequently, the homotopy lifts to a homotopy '■ 
5B — )• 5D of Holder continuous homeomorphisms of the universal covering space. Each 
homeomorphism $s rnaps ^iq— geodesies to geodesies, and for each ^jq— geodesic 7 
the corresponding Qs— geodesic ^s{l) converges to the same endpoints on the circle at 
infinity as does 7. 
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Series' construction gives a semi-conjugacy vro of a suspension flow {TiFg,cl)t) with the 
£»o— geodesic flow on ST that is nearly one-to-one in the senses described in Proposi- 
tion 13.21 We have just seen that there is a homotopy of Holder continuous homeomor- 
phisms <I>s : ST — > 5T such that each maps ^^q— geodesies to ^^s— geodesies. Set <I> = ^i; 
because <I> is Holder, it lifts to the suspension flow: in particular, there exist Holder con- 
tinuous Fi : S — )• (0, oo) and vri : Si?^ — )• ST and a Holder continuous homeomorphism 
^ : T,Fo ^ ^Fi that maps fibers of S homeomorphically onto fibers of S^i / and satisfies 
the conditions 

(24) ^'(x,0) = (x,0) for every X G S, and 

(25) TTi o ^1/ = $ O TTq. 

Thus, the projection vri : Si?^ — )- ST is a semi-conjugacy between the suspension flow on 
S and the geodesic flow on ST relative to the metric . 

Corollary 3.4. For any negative-curvature Riemannian metric qi on a compact surface T the 
suspension flow {TiFi-,(l>t) semi-conjugacy vri : S^i — ^ ST in Proposition 13.11 can he chosen 
in such a way that tt is one-to-one except on a set of first category, and only finitely many closed 
geodesies have more than one pre-image. 

3.3. Symbolic dynamics and self-intersection counts. For surfaces of constant curva- 
ture — 1 the symbolic dynamics has the convenient property that the geodesic segments 
p o Tr{J^x) and p o -ir{Ty) corresponding to two distinct fibers of the suspension flow can in- 
tersect at most once, since each is a single crossing of the fundamental polygon. Unfortu- 
nately, this property does not necessarily hold for surfaces of variable negative curvature. 
Since will be easiest for us to count self-intersections of a long geodesic by counting pairs 
of fibers whose images cross, we begin by recording a simple modification of the symbolic 
dynamics that guarantees only single crossings. 

Lemma 3.5. Given any sufficiently small e > we can assume, without loss of generality, that 
the suspension flow has been chosen in such a way that the height function F satisfies < F < e. 

Proof. This can be arranged by a simple refinement of the symbolic dynamics constructed 
above. First, consider the suspension flow obtained by cutting the sections of the flow 
space lying above particular initial symbols xq = a into boxes, refining the alphabet so 
that there is one symbol per box, and adjusting the transition rule and the height func- 
tion accordingly. In detail, fix an integer m > 1, replace the original alphabet A by the 
augmented alphabet A* := A x [m], where [m] = {1, 2, m}, and replace the transition 
matrix A by the matrix A* defined by 



Define the shift (S*, cr*) on the enlarged alphabet, with transition matrix A*, accordingly. 
Let : ^* — ^ be the projection on the first coordinate, and v* : T,* ^ T, the induced 



A*{{a,j),{a',j')) = l 



= 



= 1 



if a = a' and / = j + 1 < rn; 
if A{a,a') = 1 and j' = l,j = m; 
otherwise. 
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projection of the corresponding sequence spaces. Finally, define F* : S* — )- (0, oo) by 

F*{x*) = F{u*{x*))/m. Then the mapping p* : T,*p^ — > T^p defined by 

= {x,s + {j-l)F*{x))- 

provides a conjugacy between the suspension flow (S^^, </>*) with [T^p, (pt). By choosing 
m large we can arrange that F* < e. 

Unfortunately, this construction introduces periodicity into the underlying shift (S* , a). 
This is a nuisance, because the basic results of thermodynamic formalism [IQl, ||28H that 
we will need later, including the central limit theorem ||3T| , require that the underlying 
shift be topologically mixing. But a simple modification of the foregoing construction can 
be used to destroy the periodicity. Choose one symbol a* G A, and for this symbol only, 
cut the section of the flow space T^p over a* into m + 1 boxes, instead of the m used 
in the construction above. Adjust the transition rule A* , the height function F, and the 
projection mapping p* in the obvious manner to obtain a suspension flow conjugate to 
the original flow. The underlying shift for this modified suspension will be aperiodic, by 
virtue of the fact that the original shift (S, a) is aperiodic. 

Observe that in both of these constructions, the cylinder sets of the modified shift 
(S*,(T*) are contained in cylinder sets of (S,cr) of comparable length (i.e., within a fac- 
tor m + 1). Since in Series' symbolic dynamics cylinder sets of length n correspond to 
boundary arcs in 51} with lengths exponentially decaying in n, the same will be true for 
the modified symbolic dynamics. 

□ 

By virtue of this lemma, we can assume without loss of generality that the suspension 
flow has been chosen in such a way that the images (under the projection p o vr) of any two 
distinct fibers Fx and Fy intersect at most once in T. Thus, the number of self -intersections 
of any geodesic segment can be computed by partitioning the segment into the images of 
successive fibers (the first and last segment will only represent partial fibers) and counting 
how many pairs intersect. With this in mind, define /i : S x S — {0, 1} by setting h{x, y) = 
1 if the fibers Fx and Fy over x and y project to geodesic segments on T that intersect 
(transversally), and h{x, y) = if not. Clearly, for any periodic sequence x G S with 
minimal period m the image (under p o vr) of the periodic orbit of the suspension flow 
containing the point (x, 0) will be a closed geodesic 7 with self -intersection count 

^ m m 

(26) iV(7) = -^j;/i(a^x,a^x). 

i=i j=i 

The function h is not continuous, because a small change in the endpoints or direc- 
tions of two intersecting geodesic segments can destroy the intersection. Nevertheless, for 
"most" sequences x, y G S, a "small" number of coordinates Xj , yj will determine whether 
or not the geodesic segments corresponding to the fibers Fx and Fy intersect. Here is a 
way to make this precise. For each m>\ and each sequence x G S, denote by (x) 
the cylinder set consisting of all y G S that agree with x in all coordinates —m<j<m. 
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For any two sequences x, y G S such that the geodesic segments p o t:{Fx) and p o ^{Ty) 
intersect, let m{x,y) be the least nonnegative extended integer with the following prop- 
erty: for every pair of sequences x\y' such that x' G T.y_rn,m]{^) and y' € the 
geodesic segmentspo7r(J'^./) andpo7r(7"j^/) intersect, where -F^,/ and J^/ are the fibers of the 
suspension space over x' and y' , respectively. For sequences x, y such that the segments 
p o 7r{Tx) and p o 7r{Ty) do not intersect, set m(x, y) = —1. Define 

(27) hm{x,y) = l if m(x,y) = m > 0; 

= otherwise. 

Lemma 3.6. The function h decomposes as 

oo 

(28) h{x,y) = hm{x,y) + hoo{x,y), 

m=0 

where hoo{x,y) ^ {in which case hoo{x,y) = I) only if the geodesic segments p o ^{Tx) 
and p o 7r(Jy) intersect at an endpoint of one of the two segments. The functions km satisfy the 
following properties: 

(Al) hm{x, y) depends only on the coordinates Xi, yi with \i\ < m; 
{AD hm{x, y) / 0/or at most one m; and 

{A3) for some Q < q < \ and C < oo not depending on n, ifJ2m>n ^m{x, y) + hoo{x, y) ^ 
then the geodesies corresponding to the orbits of the suspension flow through {x,0) and 
{y, 0) intersect either 
{a) at an angle less than Cfp , or 

{b) at a point at distance less than Cg^ from one of the endpoints of one of the segments 
po'K{Tx) orpo7r{Ty). 

Proof. Only statement (A3) is nontrivial. Since the projection vr : S^? — )• ST is Holder con- 
tinuous, it suffices to prove that for all large n, if two geodesic segments intersect at an an- 
gle larger than Cg"' and at a point at distance greater than Cg"^ from any of the endpoints, 
then so will two geodesic segments of the same lengths whose initial points / directions are 
within distance Cg"^^ of the initial points /directions of the original pair of geodesic seg- 
ments. This holds because at small distances, geodesic segments on T look like straight 
line segments in the tangent space (to the intersection point). □ 

A formula similar to ((26)) holds for the self-intersection count N{t) = N{t; 7) of an 
arbitrary geodesic segment j[0,t] (which, in general, will not be a closed curve.) As in 
the case of closed geodesies, the self-intersection count can be computed by partitioning 
the segment into the images of successive fibers and counting how many pairs intersect. 
However, for arbitrary geodesic segments, the first and last segment will only represent 
partial fibers, and so intersections with these must be counted accordingly. 

For X, y G S and < s < F{x), define go{s, x, y) to be 1 if the geodesic segment corre- 
sponding to the fiber Ty intersects the segment corresponding to the partial fiber 

{(x,t) : < t < s}, 
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and if not. Similarly, define gi{s, x, y) to be 1 if the geodesic segment corresponding to 
the fiber Ty intersects the segment corresponding to the partial fiber 

{{x,t) : s<t< F{x)} 

and if not. If 7 is the geodesic ray whose initial tangent vector is p o 7r(x, s), then the 

self-intersection count for the geodesic segment 7[0, t] is given by 

(29) 

^ T T T T 

N{t; 7) = - ^ ^ h{a^x, a^x) — ''^go{s, x, a^x) — ''^gi{Sr+iF{x) — t+s, a'^x, cr*x)± error 

i=0 j=l i=l 2=0 

where 

(30) T = Tt{x) = min{n > : Sn+iF{x) > t}. 

The error term accounts for possible intersections between the initial and final segments, 
and hence is bounded in magnitude by 2. Because it is bounded, it has no effect on the 
distribution of {N{t) — Kt^)/t in the large— t limit. Note that whereas the first double sum 
in ((29)) will have magnitude 0{t^) for large t, each of the single sums will have magnitude 
0{t). Since the order of magnitude of the fluctuations in (|2]| is 0{T), it follows that the 
single sums in ((29)) will have an appreciable effect on the distribution of the normalized 
self-intersection counts in (|2]). 

The following proposition summarizes the key features of the construction. 

Proposition 3.7. For any compact surface of constant curvature —1 there exists a topologically 
mixing shift (S, a) of finite type, a Holder continuous height function F : S — > ]R_|_, and functions 
/im : 5] X S — ^ [0, 1] satisfying (A1)-(A3) of Lemma \3.6\ such that 

(SD-1) with only finitely many exceptions, each prime closed geodesic corresponds uniquely to a 
necklace; 

(SD-2) the length of each such closed geodesic is SnF{x), where n is the minimal period of the 
necklace x; and 

( SD-3) the number of self-intersections of any such closed geodesic is given by (|26)l , with h defined 
by^. 

Furthermore, there exists a semi-conjugacy of the symbolic flow (f)t onJ^p '^Hh the geodesic flow 
that is one-to-one except on a set of first category. For any geodesic 7, the number N{t; 7) of 
self-intersections of the segment ^[0, t] is given by ((29)) , with the error bounded by 2. 

Remark 3.8. The choice of the Poincare section in the construction of the suspension flow 
is important because it determines the locations of the discontinuities of the function h in 
the representation l(26)l , which in turn determines how well h{x, y) can be represented by 
functions that depend on only finitely many coordinates of x and y (cf. property (A3) in 
Lemma |3!6)) . This choice is somewhat arbitrary; other Poincare sections can be obtained 
in a number of ways, the simplest of which is by moving points of the original section 
forward a distance r along the flow lines (equivalently, replacing the semi-conjugacy vr of 
Proposition 13. 1 I by vr^ = tt o 0^). This has the effect of changing the function h as follows: 
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define /i^' : S x S — ^ {0, 1} by setting h'''{x, y) = 1 if thepovr— projections of the suspension 
flow segments 

(31) := {4's{x)}r<s<F(x)+r and := {(l)siy)}r<s<F{y)+r 

intersect transversally on T, and h^{x,y) = if not. Clearly, the representation ((26)) for 
the self -inter section counts of closed geodesies remains valid with h replaced by any K' . 
Similarly, the representation (|2]) for the self-intersection count of an arbitrary geodesic 
segment will hold when h, go, and gi are replaced by h''', g^, and g'l, where gl are the 
obvious modifications of gi. The function K'' can be decomposed as 

oo 

(32) /i''=E^- + ^~ 

m=0 

where hl^^{x,y) and h^{x,y) are defined in analogous fashion to the functions hm and 
/loo above, in particular, hl^{x,y) = 1 if m is the smallest positive integer such that 
h^{x', y') = 1 for all pairs x', y' that agree with x, y in coordinates \ j\ < m, and hlj^{x, y) = 
otherwise. These functions once again satisfy (A1)-(A2) of Lemma l3!6l Property (A3) is re- 
placed by the following (A3)': if J2n>m ^n(^5 v) + ^^(^i y) / then the geodesic segments 
corresponding to the suspension flow segments and Ty intersect either 

(a) at an angle less than C or 

(b) at a point at distance < Cg"^ from one of the endpoints of p o 7r(J^J.■) or p o T:{Fy). 

4. GiBBS STATES AND THERMODYNAMIC FORMALISM 

4.1. Standing Conventions. We shall adhere (mostly) to the notation and terminology of 
Bowen IITOl . However, we shall suppress the dependence of various objects on the transi- 
tion matrix A of the underlying subshift of finite type, since this will be fixed throughout 
the paper: thus, the spaces of one-sided and two-sided sequences will be denoted by 
S+ and S, respectively, and the spaces of a— Holder continuous real-valued functions on 
these sequence spaces by and F. (With one exception [sec. 14.211 the Holder exponent 
a will also be fixed throughout the paper, so henceforth we shall refer to a— Holder func- 
tions as Holder functions.) The spaces T = and = are Banach spaces with 
norm 

11/11 = ll/IU = l/U + ll/lloo where 
|/U = sup sup |/(x) - /(y)|/a". 

n->0 x,y : Xj=yj V|j|<n 

For any sequence x G S or x G S"*" and any interval J = {k, A; + 1}, . . . , / of Z or 
N denote by x j or x(J) the subsequence x^x^+i • • • x/, and let Sj(x) (or Sj (x)) be the 
cylinder set consisting of all sequences y G S such that y{J) = x{J). For any n G N 
let [n] = {1,2, .. . ,n}. For any continuous, real-valued function Lp and any probability 
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measure A on S or S+ denote hy Ex^f = f 'fdX the expectation of (p with respect to A, by 
Pr((^) the topological pressure of if, and for each interval J C Z write 

ieJ 

Following Bowen we shall also write Sn^ = S[n]^ o = X^^Jq^ ip o for any integer 
n > 1. 



4.2. Gibbs states. For each real-valued function (p £ T there is a unique Gibbs state fi = 
H^, which is by definition a shift-invariant probability measure /i on S for which there are 
constants < Ci < C2 < oo such that for every finite interval J C Z, 

^^^^ - exp{5,^(x) - UIPrM} " 

for all a; G S. When the underlying shift (S, cr) is topologically mixing - as we shall as- 
sume throughout - every Gibbs state is mixing (and therefore ergodic), and has positive 
entropy. Consequently, there exists a < 1 such that for every n > I, all cylinder sets 
S[o of generation n have ^—probabilities less than a". Moreover, correlations decay 
exponentially, in the following sense. For any subset J C Z, let Bj be the a— algebra of 
Borel subsets G of S whose indicator functions Ic depend only on the coordinates n £ J. 
Then for each Gibbs state /i there exist constants C < 00 and < /3 < 1 such that for each 
n > 1, 



(34) n G') - KG)KG')\ < C/3XG)^(G') V G G i3(,oo,o], G' G B[^, 



00) • 



This implies that for any specification of the "past" . . . uj-iujq, the conditional distribution 
of the "future" • • • differs from the unconditional distribution by at most 2C/3" in 

total variation norm. The exponential mixing property can be expressed in the following 
equivalent form (see ||28j , pp. 29-30): for any two q— Holder functions v, w such that 

Ef,w = 0, 

(35) \E^v{wo a'')\<C /S^'MoolMo, 

where ||t(;||Q, is the Holder norm of w. 

The uniform mixing property l|34]l implies that it is unlikely that a random sequence 
X G E chosen according to the law of a Gibbs state will have long repeating strings at 
fixed locations. This is made precise in the next lemma; it will be used in section [6] below 
(cf. Lemma 16^7)1 to show that self-intersections at very small angles are highly unlikely. 

Lemma 4.1. For any Gibbs state n there exists < /3 < 1 such that for all k ^ and all 
sufficiently large m > 1, 

(36) G S : Xi = Xi+k for all < z < m} < 
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Proof. The mixing inequality ((34)) and a— invariance of /x imply that there exist an integer 
L > 1 and < /3 < 1 such that for every L' > L and every symbol a G ^ of the underlying 
alphabet, 

K^L'+n = a I ^(_oo,n]) < P for all n G Z 
We shall consider two separate cases: first, k > L; and second, 1 < k < L. (Since every 
Gibbs state is cr— invariant, it suffices to consider only positive values of k.) In the first 
case, 

= Xk+i V 1 < i < nL) < n{xiL = Xk+iL V 1 < z < n) 

n 

= -^M n/^(^«i = ^k+iL I ^(-oo,A:+(j-l)L]) 
i=l 

In the case I < k < L, there will be a multiple of k in every interval of length L, so we 
can choose rrii < m2 < • • • < m„ such that jL < mjk < jL + L for each j < n. Now 
the requirement that Xi = Xj+fc for all 1 < i < nL forces xq = Xmjk for every j < n; 
consequently, 

H{xi = Xk+i V 1 < i < nL) < fi{xo = Xm^k V 1 < j < n) 

[n/2] 

^ n ^^(^rn2jk = Xq I S(-oo,m2j-2A:]) 

□ 

Two functions ip,ip ^ T are said to be cohomologous if their difference is a cocycle u—uoa, 
with n G J". If and are cohomologous then //(^ = and Pr((/3) = Pr(V'). Accord- 
ing to a theorem of Livsic ( IfTOl , Lemma 1.6), for every a— Holder function there exist 
y^— Holder functions , y?" both cohomologous to (p (and therefore mutually cohomol- 
ogous) such that {x) depends only on the forward coordinates xi , X2 , . . . of x and (x) 
depends only on the backward coordinates xq, x_i, 

For any function Lp G the Gibbs state is related to the Perron-Frobenius eigen- 
function and eigenmeasure i^^ of the Ruelle operator C^p : — > associated with 
(cf. 1 10 1, ch. 1, sec. C). In particular, if /i^ and v^p are normalized so that u^, and /ii^J^i^ both 
have total mass 1, and if A,^ is the Perron-Frobenius eigenvalue, then 

(37) d/i<^ = h^dv^ and = exp{Pr(v?)}. 

4.3. Suspensions. Say that a function f ^ F (or J^+) is nonarithmetic if there is no function 
g & F valued in a discrete additive subgroup of M to which / is cohomologous. \i F ^ F 
is strictly positive then the suspension flow with height function F is topologically mixing 
if and only if F is nonarithmetic. This is the case, in particular, for the suspension flow 
discussed in section|3l 
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Assume henceforth that F is a strictly positive, nonarithmetic. Holder-continuous func- 
tion on S, and let Tjphe the corresponding suspension space. For each a— invariant prob- 
ability measure /^i on S define the suspension of /i relative to F to be the flow-invariant 
probability measure fi* on with cylinder probabilities 

(38) ^ (S[„](a;) x [0,a\) = j p 

for any cylinder set and any a > such that a < F on (Here and else- 

where we use the notation [n] to denote the set of integers {1, 2, n}.) For the geodesic 
flow on a compact, negatively curved surface, both the Liouville measure and the maxi- 
mum entropy measure lift to the suspensions of Gibbs states; for the maximum entropy 
measure, the corresponding Gibbs state is fi^gp where ^ > is the unique real number 
such that Fr{—9F) = 0, and this value of 9 is the topological entropy of the flow (cf. HI, 
also [20]). If the surface has constant negative curvature then the Liouville and maximum 
entropy measures are the same, but if the surface has variable negative curvature then the 
Liouville measure is mutually singular with the maximum entropy measure, and so the 
potential function for the corresponding Gibbs state is not cohomologous to —6F. This 
is what accounts for the difference between constant and variable negative curvature in 
Theorem ll.3[ 

If the suspension flow is topologically mixing then the suspension of any Gibbs state is 
mixing for the flow. This fact is equivalent to a renewal theorem, which can be formulated 
as follows. For each T e M+ and x G S define 

(39) r(x) = Trix) = mm{n > 1 : SnF{x) > T} and Rrix) = Sr{^)F{x) - T. 

Proposition 4.2. Assume that the shift (S, a) is topologically mixing, and that F ^ F is positive 
and nonarithmetic. Then for any Gibbs state /i and all bounded, continuous functions f,g:T,^ 
Randh-.R^R, 

(40) lim [ f{x)g{a^^''\x))h{RT{x))dii{x) = [f{x)dn{x)x [ g{x)h{t) dfi* {x,t) 

where fi* is the suspension of /x. 

The special case where / = 5 = 1 is of particular interest, because it yields estimates of 
the probability that Rt falls in an interval. In particular, it implies that there exists C < 00 
such that for all e > 0, all a > 0, and all sufficiently large T (i.e., all T > t^), 

(41) G S : a < Rt{x) <a + e} <Ce. 

We will say that two (or more) weakly convergent sequences Xt, Yt of random vari- 
ables, vectors, or sequences are asymptotically independent as T — )• 00 if the joint distribu- 
tion of {Xt,Yt) converges weakly to the product of the limit distributions of Xt and Yt, 
that is, if for all bounded, continuous real-valued functions v, 

lim Eu{Xt)v(Yt) = ( lim Eu{Xt)){ lim Ev{Yt)). 
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In this terminology, Proposition l4.2l asserts that the "overshoot" random variable Rt{x) is 
asymptotically independent of the state variables x and (t'^(^) (x). 

If X G S is chosen randomly according to an ergodic, shift-invariant probability mea- 
sure ^ then tt{x) will be random. However, when T is large the random variable is to first 
order of approximation "predictable" in that the error in the approximation tt ~ T/E^F 
is of order Op{1/Vt). More precisely: 

Proposition 4.3. Assume that F : S — )- M and g : ^ R'' are Holder continuous functions, 
with F > 0, and let be a Gibbs state for the shift (S,(t). Then there exist a constant b > 
(depending on fi and F) and a k x k positive semi-definite matrix M (depending on fi, F, and g) 
such that fls r — > oo, 

(42) '^^ ~bjf^'^ ^ Normal{0, 1) and 

(43) - TE,g/E,F ^ ^^^^^^^ ^^^ 



Moreover, the limiting covariance matrix M is strictly positive definite unless some linear com- 
bination of the coordinate functions gi is cohomologous to F + cfor some constant c. Finally, 
the random vector {Brg — TE^g/E^F)/T^/'^ and the random variable {tt — T/Efj_F)/T^/'^ are 
asymptotically independent of the overshoot Rt{x) = St-(^x^F{x) — T and the state variables x 
and a'^^^^x. 

Both ((42|) and (|43|) are elementary consequences of Ratner's [3ll central limit theorem. 
(See in particular the proof of Theorem 2.1 in ||3T| . The vector-valued central limit theorem 
follows from the scalar central limit theorem by the so-called Cramer-Wold device - see, for 
instance, [4J, ch. 1.) The last assertion (regarding asymptotic independence) can be proved 
by standard methods in renewal theory (see for instance ||36|): roughly speaking, it follows 
because the values of the random variables Rt{x) and a'^^'^^x are mainly determined by 
the last 0(1) steps before time r(x), whereas the values of {Srg - T'^E^g/{E^,Ff)/T^/'^ 
and other "bulk" random variables are mostly determined long before time t{x). 



5. U-STATISTICS 

5.1. [/—statistics with random limits of summation. Let (S, a) be a two-sided shift of fi- 
nite type and F : S — ^ (0, oo) a Holder-continuous function. Assume that F is nonarithmetic: 
this ensures that the conclusions of Propositions 14.21 and 14.31 are valid. As in section |431 
define r = tt : S — >• Z+ to be the first passage time to the level T > by the sequence 
SnF (see equation ((39)) ). Let /i:SxS— s-Mbea symmetric, Borel measurable function. 
Our interest in this section is the distribution of the random variable 

t(x) t{x) 

(44) Ut{x) = ^Y^ h{a'x, a^x), for x e E, 

i=i j=i 
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under a Gibbs state fi or, more generally, under a probability measure that is absolutely 
continuous with respect to a Gibbs state. Random variables of this form — but with 
the random time t(x) replaced by a constant n — are known in probability theory as 
U— statistics, and have a well-developed limit theory (cf. [17J, L15J)- Unfortunately, the 
standard results of this literature do not apply here, for three reasons: (a) because here the 
limits of summation in (|44)) are themselves random variables, (b) because no continuity 
requirements have been imposed on the function h, and (c) because of the peculiar nature 
of the dependence in the sequence {a"'x}n£Z- 

5.2. Convergence in law under Gibbs states. Fix a probability measure A on S. 
Hypothesis 5.1. The kernel h admits a decomposition h = Ylm=i such that 

(HO) each /i^ : S — ^ M /s fl symmetric function of its arguments; 

(HI) there exists C < oo such that Y^^yi \ hm\ < C onT, x 'E; 

(H2) hm{x, y) depends only on the coordinates Xj,yj such that \j\ < m; and 

(H3) there exist C < oo and < /3 < 1 such that for allm> 1 and j G Z, 

(45) f \hm{x,a^x)\dX{x) <CP'^. 

Definition 5.2. For any bounded, symmetric, measurable function /i : S x S — )• M and any 
Borel probability measure A on S define the Hoeffding projection : S — M of n relative 
to A by 

h+{x) = / h{x,y)dX{y). 
Jt, 

Say that the kernel h is centered relative to A if its Hoeffding projection is identically 0. 



Our interest is in the large-T limiting behavior of the random variable Ut defined by 
(more precisely, its distribution) under a Gibbs state fi = fior a probability measure A 
that is absolutely continuous with respect to a Gibbs state. Observe that if is a Gibbs state 
and if h satisfies Hypothesis 15.11 relative to then the corresponding Hoeffding projection 
is Holder continuous on S, even though h itself might not be continuous. The 
following theorem will show that under Hypothesis 15.11 two types of limit behavior are 
possible, depending on whether or not is cohomologous to a scalar multiple aF of F. 
Set 

(«) 

Theorem 5.3. Let n = fi be a Gibbs state, and let h : T, x T, —?- M. be a function that satisfies 
Hypothesis 15.31 with X = fi. If the Hoeffding projection of h relative to fi is cohomologous to 
aF for some scalar a G M, then as T —?- oo, 

(47) C/.= ^^-™^'Ao 
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for some probability distribution G on M. Otherwise, 

(48) C/.= ^"-%^,f'-^''^''AG~ 

for a proper Gaussian distribution on M. Furthermore, in either case the random vector {Ut, tt), 
the state variables x, a^^^\ and the overshoot random variable Rt are asymptotically independent 
asT ^ oo. 

Proof strategy. The logic of the proof is as follows. First, we will show that the theorem is 
true for centered kernels h such that /i(x, y) depends only on finitely many coordinates of 
the arguments x, y. This will use Proposition 14.31 Second, we will prove by an approxi- 
mation argument that the truth of the theorem for centered kernels can be deduced from 
the special case of centered kernels that depend on only finitely many coordinates. This 
step will use moment estimates that depend on Hypothesis 15 . 1 1 (in particular,on the criti- 
cal assumption (H3)). Third, we will show that to prove the theorem in the general case it 
suffices to consider the case where the kernel h is centered. For ease of exposition, we will 
present the third step before the second; however, this step will rely on the other two. 

Observe that the validity of ((47)) - ((48]l is not affected by rescaling of either T or h. Con- 
sequently, there is no loss of generality in assuming that E^j^F = 1 and that the constants 
C, C' in HypothesisEUare C = C' = 1. □ 

Step 1. Assume first that h is centered (thus, /i+ is cohomologous to aF with a = 0, and 
so case ((47)) of Theorem 15.31 applies), and that h{x,y) depends only on the coordinates 
xiX2 ■ ■ ■ Xm and yiy2, , , ym- Then the function h assumes only finitely many different val- 
ues, and these are given by a symmetric, real, square matrix h{^, (), where ^ and ( range 
over the set of all length-m words occurring in infinite sequences x G S. This ma- 
trix induces a real, Hermitian operator L on the finite-dimensional subspace of L^(E,/x) 
consisting of functions that depend only on the coordinates xiX2 ■ ■ ■ Xm- Let Dm be the 
dimension of this subspace. Because h is centered, the operator L contains the constants 
in its null space. Consequently, all other eigenfunctions (p^ are orthogonal to the constant 
function 1, and thus, in particular, have mean 0. It follows by the spectral theorem for 
symmetric matrices that the [/—statistic l|44)) can be written as 

t{x) t{x) D™, D™, / t(x) ^ ^ 

Urix) = X] X] X] ^kfkicr'x)(pkicr^x) = ^>'k ^k{(^'x) 

i=l j=l k=2 k=2 \i=l 

Therefore, Proposition l4.3l implies that as T — ^ oo, 

(49) ( TT, ( ^ ^k o I A N(0, A) 

for some possibly degenerate {Dm — 1)— dimensional multivariate normal distribution 
A^(0, ^). The convergence W7\ follows, with a = and G the distribution of the quadratic 
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form of the multivariate normal. Proposition 14.31 also implies that the random vector ((49)l 
is asymptotically independent of x,a'^^^'>x, and Rt{x); consequently, so is the random 
vector with components Ut/T and tt- □ 

Step 3. Assume that the result is true for all centered kernels. We will show that the the- 
orem then holds for any non-centered kernel satisfying Hypothesis 15.11 Recall that if h 
satisfies Hypothesis 15.11 then its Hoeffding projection /i+ is Holder continuous. There are 
two cases to consider, according to whether or not /i+ is cohomologous to a scalar multi- 
ple of F. Consider first the case where /i+ is cohomologous to aF for some a G R. Thus, 
E^hj^ = a (since E^F = 1), and so for some coboundary w — w o a, 

h{x, y) = ho{x, y) + h+{x) + h+{y) - a 

= ho{x, y) + aF{x) + aF{y) + — wi^ax) — w{y) — w{ay) — a 

where /io(a;, y) is centered. This implies that 

t{x) t{x) 

i=i j=i 

t{x) t{x) 

= ^ ^ hoia'x, a^x) + 2ar(x)S'^(^)F(x) - aT{xf + t{x){w{x) - w{a^'-''^x)) 
i=i j=i 

t{x) t{x) 

= ^Y1 ^o(f^'a;, a^x) + aT^ + a(r(x) - Tf + ar(x)i?T(x) + t{x){w{x) - w{a^^''^x)) 
i=i j=i 

=: Vt + aT^ + a(r(x) - Tf + ar(x)i?T(x) + t{x){w{x) - (ct^^^^x)) 

where Vt is the [/—statistic ((44]l with the kernel h replaced by the centered kernel ho. 
Now as r oo, t(x)/T 1 a.s., by the ergodic theorem, and both Rt and fx converge in 
distribution, by Proposition 14.21 and Proposition 14.31 Consequently, the convergence ((47)) 
and the joint asymptotic independence assertions hold because by assumption they hold 
for centered kernels. 

Next, consider the case where /i+ is not cohomologous to a scalar multiple of F; we 
must prove ((48)) . As above, we assume without loss of generality that E^F = 1. Let /iq be 
the centered kernel defined by 



h{x, y) = ho{x, y) + h+{x) + h+{y) - b 
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where b = E^h+; then 

t{x) t{x) 

Ut = ^^ho{a'x,a^x) + 2r(a;)5^(^)/i+(x) - bT{xf 
i=i j=i 

t(x) t{x) 

= J2Y1 ^o(tT*^, (T^x) + 2T{x){Sr(x)h+{x) - bT{x)) + bT^ + b{T{xf - T^). 

1=1 j=l 

Now consider the effect of dividing this quantity by T^/^. Since the kernel /iq is centered, 
the double sum divided by T converges in distribution (by our hypothesis that the the- 
orem is true for centered kernels); hence, if it is divided by T^/^ it will converge to 0. 
Thus, asymptotically as T — > oo the distribution of {Ux-bT^)/^^^'^ is determined by the 
remaining terms 2T{Sr — br) jT^I"^ and 6(r^ — T'^)/T'^/'^. The ergodic theorem implies that 
t/T 1, and the central limit theorem (Proposition 1431) implies that {Srh+ - bT)/T^I'^ 
and TT = (r — T)/T^/'^ converge jointly in distribution to a two-dimensional Gaussian 
distribution; consequently, 

2T{x){S,(.,)h+{x)-bT{x)) + b{T{xf-T^) ^ . 
— TTfT^ > Gaussian. 

This proves (|48)) . The asymptotic independence assertions follow directly from Proposi- 
tionltsl □ 

Step!. Assume, finally, that h is a centered kernel which satisfies Hypothesis 15.11 With- 
out loss of generality, we can assume that the functions hm in the decomposition h = 
X]m=i themselves centered, because replacing each hm{x, y) by hm{x, y) — — 

h^{y) does not change the validity of Hypothesis l5.1[ Set 

m oo 

Vm = '^hk and Wm = ^ /ifc- 

fc=l k=m,+l 

Then each Vm is centered, so by Step 1, the result is true if Vm is substituted for h in 
the definition of Ut (but of course the limit distribution G = Gm will depend on m). 
Consequently, to prove the result for h it suffices to show that for any e > there exists m 
sufficiently large that 

t(x) t(x) 



(50) ^l{x : 



y^^y^^wm{(y'x,a^x) 

i=l j=l 



> eT} <£ 



for all sufficiently large T. 
FixO < 5 < 1/6, and set 



_{T) = + 
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By the central limit theorem (Proposition l4.3|l , n_ < r < ri_|_ with /i— probability approach- 
ing 1 as T — )• oo; thus t{x) is essentially limited to one of T^/'^+^ different possible values. 
Therefore, by the Chebyshev inequality and a crude union bound, to establish ((50)l it suf- 
fices to prove the following. 

Lemma 5.4. For each e > there exists m suffices large that for all large T, 

^^w^n,{a'x,a^x)\ <eT'^ and 
i=i j=i J 

(n n n- n- \ ^ 

i=i j=i i=i i=i / 



+ 



□ 

Proof of ((5T|) . This will use Hypothesis (H3) and also the fact that Gibbs states have expo- 
nentially decaying correlations (equation ((34)) ). Since y) depends only on the coordi- 
nates Xj, yj with \i\ < k, and since \ hk\ < 1 (see the earlier remark on scaling), exponential 
correlation decay implies that for all A;, r > 1, 

(53) \Eihk{x, a'^+'-x) I i3(_oo,fc]u[2fc+2r,oc))l < C(3r 

For convenience, we shall assume that the constants < /3 < 1 in ((34)) and in Hypothesis 
(H3) are the same (if the two constants are different, replace the smaller with the larger). 

When the square in ((51)) is expanded the resulting terms have the form 

Ehk{a^x, x)hki{a^ x, x), 

with k,k' > m and < < T. Let A be the largest integer such that one of 

the four indices is separated from the other three by a gap of size A, and let 

A;* = max(A;, k'). Then by the exponential correlation decay inequality ((53)) and Hypothesis 
(H3) (using the fact that \ hkhki\ < \hkj), 

\Ehk{a'x,a^x)hk'{a'' x,a^' x)\ < Cmin(/3^-2^*,/3'=*). 

For any given value of A > 1, the number of quadruples i, i' ,j, j' < T with maximal gap 
size A is bounded above by 96T^(2A + 1). Furthermore, for each k^ > m the number of 
pairs k,k' > m such that max(/c, k') = k^ is less than 2k^. Consequently, 

\ ^ oo oo 

Y,Y.'^m{^'x,a^x)\ <C'T^Y1 E(2A + l)fc*min(/3^-2^*,/3'=*) 

i=l j=l J fc*=mA=0 

where C = 192C. By choosing m sufficiently large one can make this bound smaller than 

er2. □ 
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Proof of ((52)) . This is similar to the proof of ((51]) . the difference being that here it is nec- 
essary to count octuples instead of quadruples. The key once again is the exponential 
correlation decay inequality ((53)) : this implies that for any 4 triples ir,jr, kr, 

4 

\E^llhk^{a'^x,a^^x)\ < C mm{P^~^''% P''*) 

r=l 

where = maxi<r<4 fcr and A is the maximal gap separating one of the indices ir,jr 
from the remaining 7. For each r < 4 the indices ir,jr that occur in ((52)) are constrained as 
follows (taking v to be the smaller of the two): either 

1 < ir < n_ < jr ^ n or < ir < jr < n. 

Consequently, for each A > 1, the total number of octuples (v,ir)i<r<4 with maximal gap 
A that occur when the fourth power in ((52)) is expanded is bounded above by C' A^T^^"^^ 
for some constant C" < oo independent of T and A. For each A;^, > m, the number of 
quadruples ki,k2, k^, k4 with maximum value A;^, is bounded above by Ak^. Therefore, for 
each n such that ri- < n < n+. 



1=1 j=i 1=1 j=i 




for a constant C" < oo independent of T and m. By choosing m large one can make this 
bound smaller than eT^+^'^. □ 



5.3. Extensions. 

Corollary 5.5. Let n = nbe a Gibbs state, and let Xbe a Borel probability measure on E that is 
absolutely continuous with respect to fi and such that the likelihood ratio dX/dp, is continuous on 
S. Let /i : S X S — ^ M &e a function that satisfies Hypothesis 15. 1 1 relative to p. Then all of the 
conclusions of Theorem 15.31 remain valid under the measure A, and the joint limit distribution of 
Ut, tt, and Rt under A is the same under A as under p. 

Proof. This follows from the asymptotic independence assertions of Theorem 15.31 Con- 
sider first the random variable Ut'- show that it converges in distribution under A, we 
must prove that for any bounded, continuous test function : — IR/ the expectations 
E\iP{Ut) converge as T — ;> oo. But since dX/dp is a bounded, continuous function, the 
convergence ((47)l - ((48)) and the asymptotic independence of x and Ut{x) under p imply 
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that 

lim Exi^iUr) = Hm S^V(^^t)^ 

T-i-oo T^oo dfl 

= lim S^V(^^t) lim 

T^oo T->oo a// 

= lim E^tPiUr). 
T700 

(Note: This holds in both the case where /i+ is cohomologous to a scalar multiple of F and 
the case where it isn't.) A similar argument, using Proposition 14.21 and Proposition 14.31 
proves that the random variables Rt and tt converge in distribution under A to the same 
limit distributions as under fi, and that the various random variables, vectors, and se- 
quences are asymptotically independent. □ 

This result will suffice to deduce limit results about continuous-time [/—statistics in 
suspension flows under suspensions of Gibbs states (cf. section |43)) from corresponding 
results about discrete-time [/—statistics in shifts of finite type. For dealing with measures 
like the uniform distribution on the set of periodic orbits of minimal period < T the fol- 
lowing variant of Corollary 15.51 will be needed. 

Corollary 5.6. Let fi = fi be a Gibbs state and h : x T, ^ R be a function that satisfies 
Hypothesis 15. 1 1 relative to ^i. Let {At}t>i a family of probability measures on S such that as 
T — )• 00, 

(54) ^(x) ~ gi{x)g2{a^^''^x)g3{RT{x)) 

where 51,52 : 5] — > [0,oo) and 53 : [0,oo) are nonnegative, bounded, continuous functions 
not depending on T, such that 53 is strictly positive on an interval and §2,93 both have positive 
expectation under E^. Then as T — > 00 the joint distribution ofUr, t't, Rt, x, and a'^^^^x under 
At converges. Moreover, the limiting joint distribution ofUr and fx is the same as under p,. 

Proof. The proof is virtually the same as that of Corollary |5.5l in particular, for any bounded, 
continuous test fimctions ipi,'ip2,''p3 : K — )■ M and ^4 , -(/^s : S — M, 

(55) lim Ex^MRT)MUT)MrT)Mi^5oa^) 

T— )-oo 

= lim £;;,V'i(^t)V'2(C^t)V'3(7't)V'4(V'5 ocr^)5i(52 o (^^)93{Rt{x)) 

by Theorem [531 since gi, §2, and 53 are continuous. Moreover, because the random vari- 
ables x,a'^^^\x), and Rt are asymptotically independent of {Ut,t't) under p, they will 
also be asymptotically independent under Xt, and the limit distribution of {Ut, tt) will 
be the same as under state p. However, the limit distributions of Rt and will in 

general be different, because unlike the bulk variables {Ut,tt) t hese random variables 
are highly dependent on the last few coordinates of x before r(x). In particular, the limit 
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distribution of a'^^^\x) will be "tilted" by the likelihood ratio §2- 

lim Ex{ip o (T^) = lim E^^Tp o a'^){g2 o a^) = E^^g2. 

□ 

Remark 5.7. The corollary remains true if it is only assumed that is piecewise continuous, 
in particular, if g^ is the indicator function of a bounded interval [a,b]. This can be proved 
by a routine sandwiching argument, using the fact that : Rt{x) G (6 — e, 6 + e)} = 
0{e), by the renewal theorem (Propositior l4.2|) . Observe that for this the standing assump- 
tion that F is nonarithmetic is essential. 



6. Verification of Hypothesis 15. II 

6.1. Main Result. To deduce Theorem 1 1 . 1 1 from the results of section|5]it will be necessary 
to show that the relevant fimction /i : S x S — ^ M in ((26l) satisfies Hypothesis 15. II For the 
functions h and defined in section |3l the properties (HO)- (H2) hold trivially, so only 
statement (H3) of Hypothesis lS.ll warrants consideration. The purpose of this section is to 
prove that for any Gibbs state ^ there exist values of r such that the function K'' defined in 
Remark meets the requirement (H3). 

By Proposition 13.21 and Corollary 13.41 for any compact, negatively curved surface T 
the geodesic flow on ST is semi-conjugate to a suspension flow {T;F:(t>t) over a topo- 
logically mixing shift of finite type (S,(7) with a Holder continuous height function F. 
This semi-conjugacy is one-to-one except on a set of first category, and both the Liou- 
ville measure and the maximum entropy measure for the geodesic flow pull back to sus- 
pensions of Gibbs states on S. Furthermore, points x G S of the underlying shift are 
mapped to pairs of points on in such a way that the suspension-flow 

orbit through (x, 0) is mapped to the geodesic whose L— lift to the Poincare plane has 
endpoints and this mapping sends cylinder sets to arcs of satisfying 

(D)-(E) of Proposition 13.21 By Lemma 13.51 for any small e > the symbolic dynamics 
admits a refinement for which the height function F satisfies F < e. By Proposition 13.71 
the self -intersection counts for closed geodesies and geodesic segments are given by equa- 
tions ((26l) and (|29)l , with h = W for any < r < min F. The function K'' decomposes as in 
equation (|32]|. 

Proposition 6.1. For any Gibbs state fi, the functions satisfy (H3) of Hypothesis 15. 1 1 relative 
to fifor almost every r in some interval [0, r*] of positive length r^,. 

The remainder of this section is devoted to the proof of this proposition. The key is the 
property (A3)' (cf. Remark |3!8)) , which asserts that there exists q < 1 such that /ij^ / 
for some n > m only if the geodesic segments corresponding to the suspension flow 
segments T'!^ and (cf. equation ((3T|) ) intersect either at an angle less than C'g"^, or at a 
point within distance C'g^ of one of the endpoints of the two geodesic segments. 
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For any integers m,k > 1 define 

A'!^ )^ = {xell ■.po7r[T^) and P°T^{^l.k^) intersect at angle < and 

B^ f^ = {xell :po7r(J^^) and P°7r(-F^fc^) intersect at distance < ^™ of 

p{it{x, 0)) or p{Tr{x, F{x) — r))}. 

To show that (H3) of Hypothesis 15. 1 I holds relative to a Gibbs state fi it is enough to show 
that there exist C < oo and /3 < 1 such that for all sufficiently large m and all k 0, 

(56) M^;,,,) + Mi?Lfc)<C/3™- 

We will show in Lemmas 16.41 and 16^1 that each of the probabilities /u(^^ ^) and fJ.{B^ y.) is 
exponentially decaying in m, uniformly in k, for almost every r in a small interval [0, r^] 
of positive length. 

6.2. Intersections in small balls. We begin with fJ-{B^ f,). The strategy for bounding this 
will be to first handle the case \k\ < expjem} for small e > by a density argument, and 
then the case \ k\ > exp{em} by using the exponential mixing property (|35)l of Gibbs states. 

Lemma 6.2. IfO < g < a < 1, then for any Gibbs state ^ and almost every r < minF/3, ifm is 
sufficiently large then 

(57) /.(S;,,,) < a- for all \k\ < {aj eT'^ k / 0. 

Vroof. Without loss of generality we can assume (cf . Lemma I3.5D that no two geodesic 
segments of length less than 2 max F intersect transversally more than once. For x G S let 
Bm,k{x) be the set of r G [0, F{x) + F{ax)] such that the geodesic segments po vr (7a; U J^^x) 
and po IT [T^k^UJ^^k+i^) intersect at distance less than of po 7r(x, r). Because there is at 
most one intersection, the Lebesgue measure of Bm,k{x) is less than 2g"^. Since x G BJ^ ^ 
implies that r G Bm,k{x), it follows by Fubini's theorem that for any a G {g, 1), 

niLebir G [0,minF/3] : fiiB^^^^) > a'"} < 2{g/ar =^ 

rriLebir G [0,minF/3] : n{B'^^ ^) > a" for some 

Since X],;„(f?/a)™^^ < oo, it follows by the Borel-Cantelli lemma that for almost every 
r G [0, min F/3] the inequality (|57|l holds for all sufficiently large m. □ 

Lemma 6.3. For any Gibbs state fj, on and any T < oo, there exist 5 = 6{fj,, T) > and 
C = CT,fi > with the following property: for any ball B in T of sufficiently small diameter 
e > 0, 

(58) n{x G S : po 7r({0f(x,O)}o<t<T) intersects B} < Ce^. 

Proof. In the special case where the suspension /i* of the Gibbs state ^ (see equation (|38ll) 
is the puUback of the Liouville measure this is apparent from purely geometric consider- 
ations, as we now show. We may assume that for e sufficiently small the image p o tt{Fx) 
of a fiber can intersect a ball of radius 2e at most once. Since the surface area measure of 
a ball B{x, e) of radius e is x e^, the ergodic theorem implies that the long-run fraction of 
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time spent in B{x, 2e) is almost surely Ke^, for a constant K independent of e. On each 
visit to B{x,e) a geodesic must spend time at least K'e mB{x, 2e). Consequently, the long 
run fraction of the sequence of fibers po 7r(J>n^) on a geodesic that visit B{x, e) is less than 
Ke/K'} thus, by the ergodic theorem, ((58)) holds with 6 = 1. 

Unfortunately, for Gibbs states in general there is no simple relation between the surface 
area and the Gibbs measure, so a different argument is needed. Consider first the case of 
a Riemannian metric with constant curvature —1. Recall that in this case the surface T can 
be identified with a compact polygon V in the Poincare disk D whose edges are pasted 
together in pairs. With this identification, any ball i? in T corresponds to a ball of the same 
radius in the interior of V, provided this ball does not intersect dV, or otherwise a union 
of at most Ag sectors of balls of the same radius, where 4g is the number of sides of V. 
Thus, a geodesic segment of length less than min F that intersects B will lift to a geodesic 
segment in B that intersects one of up to Ag balls of the same radius, all with centers in the 
closure of V. Consequently, a geodesic segment of length T in T that intersects a ball of 
radius e in T lifts to a geodesic segment in D that intersects one of up to 4gT balls of the 
same radius, all with centers at distance no more than T from V. 

Fix a point G dD on the circle at infinity, and consider the set of all geodesies in 
B with C_ as an endpoint (as t — — oo) that intersect a ball B of radius e with center in 
V U dV. For any such geodesic, the second endpoint C+ on 9B is constrained to lie in an 
arc J(C-5 B) of length < Ke, where K is a constant that does not depend on C_ or on the 
center of B. Recall (Proposition l3.2|l that specification of the endpoint C- of a geodesic is 
equivalent (except on a set of first category) to specification of the backward coordinates 
x^ of the sequence x G S that represents the geodesic; and similarly, specification of the 
endpoint C+ is equivalent to specification of the forward coordinates By (D)-(E) of 
Proposition 13.21 it follows that constraining C+ to lie in an arc of length < Ke has the 
effect of constraining its forward itinerary x+ to lie in a union of one or two cylinder 
sets i;j[)^j(y) with m = K'loge^^. Now for any Gibbs state /x there exists /3 < 1 such 
that the /x— measure of any cylinder set S[i^](x) is less than (3"^. Moreover, by inequality 
((34)) , the conditional measure /x(-|x~) given the past is dominated by a constant multiple 
of the unconditional measure fi. Thus, if G'^{B) denotes the set of all x G S such that 
the suspension flow segment (a;, 0)}o<t<T lifts to a geodesic segment that intersects B, 
then 

This implies ((58)) in the constant curvature case. 

This argument extends to metrics of variable negative curvature, with the aid of the 
structural stability results of section 13.21 Let be a metric of variable negative curva- 
ture and qq a metric of curvature -1. Recall that the ^^i— geodesic flow is orbit-equivalent, 
by a Holder continuous mapping <I> : ST — )• 5T, to the ^^ogeodesic flow, and that the 
homeomorphism <I> lifts to a homeomorphism $ : 5B — )■ ^B of the universal cover. Each 
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£)o— geodesic in D corresponds under ^ to a gi — geodesic, and these have the same end- 
points on 9D and the same symbolic representation x G S. Because ^> is Holder, con- 
straining a geodesic to pass through a ball of radius e forces the corresponding 
£>o— geodesic to pass through a qq— ball of radius e", for some q > depending on the 
Holder exponent and all e sufficiently small. Therefore, the problem reduces to the con- 
stant curvature case. □ 

Lemma 6.4. If g < 1 then for any Gibbs state fi and for almost every < r < minF/3, there 
exists a <1 such that ifm is sufficiently large then 

(59) fiiB:^^,)<a"' forallk^O. 

Proof Lemma l6!2l implies that if g < a < 1 then for almost every r < min F/3 the inequal- 
ity (|59]l holds for all \k\ < [aj gf^l"^ . We will show that for every < r < minF/3 the 
inequality ((59]l also holds for > (a/^^)™/^; for this we shall appeal to the exponential 
mixing inequality ((34)) . using Lemma 1631 to control the first moment. The proof will rely 
on the following elementary geometric fact: For any compact Riemannian manifold Ai there 
exists K < oo (depending on the metric) such that for every sufficiently small e > there is a 
finite set of points zi, Z2, . . . , Zn such that every x e M is within distance e of some Zi, but is 
within distance 6e of at most k distinct points Zi. Call such a collection of points Zi an efficient 
e— packing. 

In order that x G ^ it is necessary that the geodesic segment p o 7r{J^^f,^) intersects 
either the ball of radius g"^ centered at p o 7r((/),.(x)), or the ball of radius g"^ centered at 
p{(t>F(x)-r{x))' oi" both. Let zi, .., z„ be an efficient packing and let B{zi,3g'^) be the 
ball of radius 3g"^ centered at Zj. Then 

n 

Fi ' c~ I I n 

i=l 

where is the set of all x G S such that p o 7r(x, r) G B{zi, Sg"^) and Gi^rn,k is the set of 
all X G S such that the geodesic se^m.en\.poTT{T^kr^yjF„k+ir^) intersects B{zi, 3^3™). (This is 
because Ty C TyU T^y.) Since zi, .., 2;„ is an efficient g™— packing, at most k of the events 
Hl,^ can occur together; consequently, 

n 

i=l 
n 

1=1 

< Km.aXfi{Gi^rn,k I Hi^). 
i<n 

Thus, it remains to bound the conditional probabilities fJ-{Gi^m,k \ -f^fm) 1^1 > {olI qY^^"^ ■ 
For each i let < ^ 1 be a smooth function on T with Lipschitz norm less than 
6^3""^ that takes the value 1 on B(zi,3g^) and on the complement of B{zi,6g"^). For 
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each X G S and < r < minF/3 define 

gi,m{x) = max '4)i{po'K{x,s)) and 

0<s<F{x)+F{ax) 

hlmi^) =^i{poTT{x,r)). 

Since the projection p o vr is 5— Holder continuous for some exponent 5, both gi^rn arid 
m have (5— Holder norms bounded by 6||p o vrH^^j^™. Therefore, the exponential mixing 
inequality (|35)l implies that for some C < oo and < /3 < 1 independent of i,m,k and r. 

For I A;| > {a/ q)^/"^ the second term is super-exponentially decaying in m. But Lemma 1631 
implies that the expectation E^gi^m is bounded by {Qq^Y for some q> Q, and so the result 
now follows. 

n 

6.3. Intersections at small angles. Next we must show that the events have uni- 
formly exponentially decaying probabilities, in the sense (|56)). The strategy here will be 
to show that if two geodesic segments corresponding to distinct fibers of the suspension 
flow cross at a small angle, then it will be impossible for their successors to cross for a 
long time. This fact, coupled with the ergodic theorem, will imply that the probability of 
a crossing at a small angle must be small. The key geometric fact is as follows. 

Lemma 6.5. For any k > sufficiently small and any q <1 there exists C < oo such that for all 
large m > 1 the following holds. If two geodesic segments 7([0, 2k], x) and 7([0, 2k], y) of length 
2k cross transversally at an angle less than g"^ then for every 1 < j < Cm the geodesic segments 
'y{[jn,jK + 2k,], x) and j{[jK,jK + 2k], y) do not cross. 

Proof. Let k > be sufficiently small that no two geodesic segments of length 3k on T can 
cross transversally more than once. Consider lifts ^(t, x) and 7(t, y) of the geodesic rays 
j{t, x) and j{t, y) to the universal covering surface T whose initial segments 7([0, 2k], x) 
and 7([0, 2k], y) cross transversally at angle < g™. These geodesic rays cannot cross again, 
because for any two points in a Cartan-Hadamard manifold there is only one cormect- 
tng geodesic. Consequently, if for some j the geodesic segments 7([jK, jk + 2k], x) and 
^{[jK,jK + 2k],?/) were to cross, then their lifts j{[jK,jK + 2k], x) and 7([jK,jK + 2k], y) 
would contain points w, z, respectively, such that z = gw for some element 5 / 1 of the 
group of deck transformations. However, if the initial angle of intersection is less than 
Q-m |.]-,gj-^ ^\yQ geodesic rays 7(t, x) and ^{t, y) cannot diverge by more than e for time Cm, 
where C is a constant determined by e and the curvature of T (which is bounded, since 
T is compact). If e > and k > are sufficiently small then this would preclude the 
existence of points w, z such that w = gz for some 5 / 1. □ 

Because the semi-conjugacy vr : Si? — ^ ST is not one-to-one, two orbits of the geodesic 
flow can remain close for a long time but have symbolic representations that are not close. 
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The next lemma shows that, at least for the symbolic dynamics constructed by Series (cf . 
section |3!2)) and refinements such as that described in the proof of Lemma |331 this event 
has small probability under any Gibbs state. 

Fix a > and e > 0, and for each m > 1 let Dm = Dm'^ be the set of all sequences x G S 
such that there exists (y, s) G satisfying 

distance(7r((/»t(x,O)),7r(0t(y,s))) < e for all \t\ < e"™ and 
Xi 7^ Hi for some |i| < m. 

Lemma 6.6. Let be any Gibbs state. Then for all sufficiently small e > and all sufficiently 
large a there exist (3 < 1 and C < oo such that 

(60) fi{Dm) < CP"" for all m > 1. 

Proof. Recall (Proposition |33] and following) that the geodesic flow with respect to a Rie- 
mannian metric of variable negative curvature is orbit-equivalent to the geodesic flow on 
the same surface but with a Riemannian metric of constant curvature — 1, and that the or- 
bit equivalence is given by a Holder-continuous mapping ST — ^T. Therefore, it suffices 
to prove the lemma for the geodesic flow on a surface of constant curvature —1. (The con- 
formal deformation of metric might change the values of (3 and e, but this is irrelevant.) 

Suppose, then, that the Riemannian metric has curvature —1, and that vr o (j)t{x, 0) and 
vr o (pt{y, s) are two geodesies on the unit tangent bundle that stay within distance e for 
all |t| < e"™, for some small e and large a. Because distinct orbits of the geodesic flow 
separate exponentially fast (at exponential rate 1, since the curvature is —1), the initial 
vectors 7r(x, 0) and 7r(y, s) must be within distance Ke""™, for some constant k = K{e) > 
independent of a and m (provided m is sufficiently large). 

Recall that geodesies can be lifted to SH via the mapping L described in section lT2l This 
mapping has discontinuities only at vectors tangent to one of the sides of the fundamental 
polygon V, but everywhere else is smooth; consequently, either 

(a) L o 7r(x, 0) and L o 7r(y, s) are within distance Cne~'^^, or 

(b) L o 7r(x, 0) is within distance Ckc''^^ of a vector tangent to one of the sides of V. 

In case (a), the lifted geodesies must have endpoints on dIS> that are within distance CKe""*"; 
in case (b) the lifted geodesies must have endpoints within distance Ckc^"™ of the end- 
points on 9ID of one of the geodesies that bound V (recall that the sides of V are geodesic 
arcs). In either ease, if x and y disagree in some coordinate |i| < m then by Proposition l3.2l 
at least one of the endpoints of the geodesic Lovro^j (x, 0) must be within distance C'/te""™ 
of one of the endpoints of an arc Jk{z^) of some generation k < m. (Recall that the arcs 
Jfc(z+) correspond to cylinder sets Sp^j(z+).) There are at most e^"^™ such endpoints, 
where A is the number of sides of V. 

If C is one of the endpoints of an arc Jkiz^) of generation k, then C has two symbolic 
expansions (i.e., there are two sequences z^,zt that are mapped to ( by 0- Since the 
arcs Jfc(-) do not shrink faster than exponentially (Proposition 13.21 part (D)), the forward 
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endpoint ^(2;+) of the geodesic L o n o (ptix, 0) will lie within distance C'kc ""^ of 
only if either 

Xi = Zi V < i < C"am or Xi = {z^)i y < i < C"am, 

for a suitable constant C" > not depending on m or a. Hence, since a Gibbs states 
will attach mass at most e^^™ to cylinder sets of generation m, for some b = b{fi) > 0, it 
follows that 

Ai(-Dm) < C'" exp{—bam} exp{2Am} . 
By choosing a > such that ba > 2A we can arrange that (|60)) holds. □ 

Lemma 6.7. for Any G/fobs sfflfe fx there exists P = /3{g) < 1 such that for all sufficiently large 
m and all / 0, 

Proof. By Lemma 16.51 it suffices to show that there exist a > 0, e > and /? < 1 such that 
for all large m, 

Suppose that x e ^ \ Dm^; then the geodesic segments p o 7r(J\JJ) and p o vr(J^)^j._^ cross 
at angle less than g^; in particular, there exist r < si < F{x) + F{ax) and r < S2 < 
F{a^x) + F{a^^^x) such that p o p(x, si) = p o 7r(cr^, 52)- Consequently, for some a > 
depending on the curvature of the underlying Riemannian metric, 

distance(7^((/.i(x,Sl)),7r((/>^(^J^S2))) < e for all |t| < e"'". 

Since x -Dm^, it follows that Xi = Xj+fc for all |i| < m. The lemma now follows from 
Lemma |4.1[ 

□ 

7. Proof of TheoremI1.1I 

In this section we deduce Theorem 11.11 from the results of section |5l using the symbolic 
dynamics for the geodesic flow outlined in section |3l For this symbolic dynamics, the 
normalized Liouville measure vl pulls back to a measure ji* on the suspension space S 
that is the suspension (cf. equation (|38)) ) of a Gibbs state /x = on S. Proposition 16.11 
implies that for any Gibbs state ji there exist values of r such that the functions and /ij^ 
in equation ((32)) satisfy Hypothesis 15 . 1 1 for A = /x^, and therefore also for any probability 
measure A on S that is absolutely continuous with respect to /i^. Recall from Remark |3^ 
that replacing the functions h, h„i by /i*", /ij^ is equivalent to moving the Poincare section of 
the suspension flow. For notational ease, we shall assume henceforth that the cross section 
has been adjusted in such a way that (H3) holds for r = 0, and drop the superscript from 
the functions h,hm- 

Throughout this section, we will let A be the projection to S of the suspension measure 
//*, that is, the absolutely continuous probability measure defined by 

(61) X{A) = E^^ilAF)/E,^F 
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Lemma 7.1. The Hoeffding projection h+ of the function h relative to the measure A is a scalar 
multiple of F, in particular, 

(62) h+ = kF 
where k = l/(47r|T|). 

Proof For any x € S, the value /i+(a;) is the probability that the geodesic segments p o 
7r{Tx) arid p o 7r{Fy) of the suspension flow will intersect when y is randomly chosen 
according to the law A. (By Lemma 1331 we can assume that the symbolic dynamics has 
been refined so that any two such segments can intersect at most once.) Since A is the 
projection to S of the puUback p.* of the Liouville measure, it follows by Lemma f2J2\ that 
the probability in question is h+{x) = kF{x). (To see this, partition each of the geodesic 
segments into sub-intervals of length 6, apply Lemma l2!2l and let (5 — )■ 0.) □ 

Lemma 17.11 implies that case ((47)) of Theorem 15.31 obtains. Theorem 11.11 would now 
follow immediately if not for the presence of the "boundary terms"sum Yliido + ffi) in 
l|29ll , since this is of order 0{T). The following lemma will show that this sum, normalized 
by T, converges in distribution as T — )• oo, and that the limits depend only on the initial 
and final points of the flow segment. 

Lemma 7.2. For X— almost every x G S , every < s < F{x), and every < r < F{a'^^x) 

(63) lim T~^''^^gQ{s,x,a^x) = SK and 

i=l 

TT 

(64) lim T~'^y^ gi {SrT+iF{x) -r,x, a'x) = rn, 

i=0 

where k = Ky = l/(47r|T|). 

Proof. The relation (|63)l follow from the results of section |2] and the ergodic theorem. The 
sum ^[=1 5'o('S, 2;, (T*x) counts the number of intersections of the geodesic segment p o 
7r{{(f>t{x, s)})_s<j<o with the union of the segments p o p{F^3x) for < j < tt{x); this sum 
can be re-expressed in terms of the intersection kernel (cf. section |2]), yielding 

TT [s/5\ [T/5\ 

y2go{s,x,a'x) = 1™ V V Hsij{-s + iS),j{jS)) + 0{1). 

i=l i=l j=l 

(The error term accounts for the possibility of an intersection with the geodesic segment 
corresponding to the final partial fiber, and therefore is either or 1.) For each fixed point 
7(— s + i6), the ergodic theorem and Lemma f2J2\ imply that 

[T/S] 

lim V Hsm-s + i5),j{j6)) = 5\ 

T-)-oo ^ — ' 
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almost surely. Letting 6^0 one obtains the first limit in ((63)) . The second limit is obtained 
in a similar fashion. □ 

Proof of Theorem 11.31 The measure A is the projection of the suspension measure fi*^, which 
in turn is the puUback to the suspension space Si? of the Liouville measure on ST. Thus, 
if {x,s) e Si? is randomly chosen with distribution jj,*^ then x has distribution A and 
the normalized vertical coordinate s/F{x) is uniformly distributed on the unit interval 
[0, 1], and is independent of x. Hence, Lemma [7.21 implies that if (x, s) has distribution 
/Lt^ then the first normalized boundary sum ((63)) will converge to YF{x)k,, where Y is 
a uniform —[0, 1] random variable independent of x. Since the double sum in 
the representation ((26)) depends only on x, it follows from Corollary 15.51 that the random 
variables 

Y:iY:oh{a^x,a^x)-E,^^h+ Egi go(^, ^, ^^^) 

T T 
are asymptotically independent as T — oo. Similarly, by Corollary 15.51 and the renewal 
theorem, if (x, s) has distribution /i^ then the overshoot Rt and the terminal state cr^x are 
asymptotically independent of x, Y and of the double sum and so by Lemma [7!2l 

the second boundary sum 

TT 

T^^ ^ gi {SrT+iF{x) - t + s,x, a'x) 

is asymptotically independent of the other two sums in ((29)) . Theorem ll.ll now follows, as 
Lemma [7!2] implies that the normalized boundary-term sum converges almost surely and 
Corollary |53] implies that the normalized sum YliYli converges in distribution. □ 



8. Proof of TheoremI1.2I 

8.1. Local self-intersection counts. Recall that for any smooth function : T — )• R_|_ the 

if— localized self-intersection counts of a geodesic 7 are defined by 

N{T) 

N^{T) = N^{T;j) = Y^^ix^) 

i=l 

where N{T) = N{T; 7) is the number of self-intersections of the geodesic segment 7[0, T] 
and Xj are the locations of the self-intersections on T. Like the global self-intersection 
counts, these can be expressed as sums of suitable functions defined on a shift of finite 
type. Let (Si?, (/f^) and (S, a) be the suspension flow and shift of finite type, respectively, 
provided by Proposition l3.7[ Define a function /i<^ : S x S — )■ R_|_ by setting 

(65) h^{x, y) = ip{z{x, y))h{x, y) 

where /i = 1 if the geodesic segments corresponding to the suspension flow segments Fx 
and Fy intersect at a point z = z{x, y) G T, and /i = if these segments do not intersect. 
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By the same reasoning as in equation (|29)) . 

(66) %{T; 7) = ^ £ E ^^(^'^' ^'^) + OiT). 

1=1 j=i 

The error term accounts for intersections with the geodesic segments corresponding to 
the first and last partial fibers (cf. equation ((29)l ), of which there are at most 0{T). Because 
the normalization in Theorem [L2](cf. relation @) entails division by T^/^, the error term 
in (|66l l can be ignored. 

The proof of Theorem ll.2[ like that of Theorem ll.ll in section[7l will rely on Corollary l5.5l 
Once again, let /i* be the puUback of the Liouville measure to Sp; recall that this is the sus- 
pension of a Gibbs state ^ for the shift. Let A be the projection of ^* to S, as defined by 
l|6T]l . This is absolutely continuous with respect to ^, so by Corollary 15.51 the conclusions 
of Theorem 15.31 remain valid for A. We must show (1) that the function h^p satisfies Hy- 
pothesis l5.1l with respect to A, and (2) that it is the second case of Theorem 15. 31 that applies 
when the support of / has small diameter, that is, that the Hoeffding projection 

(67) h+{x) := [ h^{x,y)dX{y) 

is not cohomologous to a scalar multiple of F. The first of these tasks will be carried out 
in section lS^ by an argument similar to that carried out in section|6]above for the function 
h. The second will be addressed in sections I82VI8.3I 

8.2. Representation of the Hoeffding projection. For each small 6 > define a function 
H'^ : ST X ST by setting H'g{u, v) = (p{z{u, v)) if the geodesic segments of length 6 based 
at u and v intersect at a point z{u,v) G T, and setting Hf{u,v) = otherwise. This 
is the obvious analogue of the intersection kernel Hs defined in section |2l The primary 
difference between the self-intersection kernel Hs and the localized kernel Hf is that the 
constant function 1 is not, in general, an eigenf unction of Hf. To see this, define 

W = T^/ Hf{u,v)Lidv) 

where L is the normalized Liouville measure on ST. 
Lemma 8.1. If f : T ^M.is continuous then 

lim \\kf - opiloo = 0. 

Proof. Because is continuous and ^^^{u, v) is nonzero only for pairs u, v at distance < 6, 
the value of ip{z{u, v)) will be close to ^p{pu) when 5 > is small, uniformly for u G ST. 
Hence, 

\ip{pu)Hs{u.,v) — K,5'^H'^{u.,v)\ < max \ip{pu) — Lp{pv)\. 

d(u,v)<S 

Since the constant function 1 is an eigenfunction of Hs, with eigenvalue (by Lemma lZ2)) , 
the result follows. □ 



SELF-INTERSECTIONS OF CLOSED GEODESICS 



43 



The relevance of the kernel iJj is that the Hoeffding projection /i+ defined by ((67|) can 
be expressed approximately in terms of k'^. Both /i+ and kg are defined as expectations 
of ip— values at intersection points of geodesic segments: (i) is the expected value of 

/ at the intersection point (if there is one) of the geodesic segments corresponding to the 
fibers Tx and of the suspension flow when y is chosen according to the distribution A; 
and (ii) kf is the corresponding expectation for the geodesic segments of fixed length 6. 
Hence, for small 6 the value of h^{x) can be obtained approximately by integrating along 
the fiber Tx- Together with Lemma this implies that if 7(t) is the orbit of the geodesic 
flow corresponding to the orbit (ptix, 0) of the suspension flow then 



8.3. Coboundaries of the geodesic flow. If /, 5 : S — R are Holder continuous functions, 
then a necessary and sufficient condition for / and g to be cohomologous is that they sum 
to the same values on all periodic sequences (cf.[10J, Theorem 1.28 for the sufficiency). In 
particular, for every periodic sequence x £ H with period (say) n = n{x), 



In the case of interest, the relevant functions are integrals over fibers of the suspension 
space Tip- For the function F this is obvious: 



while for the Hoeffding projection h'^ it follows from formula l|68ll . Consequently, for F 
and a/i*^ to be cohomologous it is necessary that the function aip o p — 1 integrate to 
on every periodic orbit of the suspension flow. Since both ip o p and the constant 1 are 
puUbacks of smooth functions on T, this implies that a(p — I must integrate to on every 
closed geodesic. 

Call a continuous function i/j : T ^ a coboundary for the geodesic flow if it integrates 
to zero along every closed geodesic, and say that two functions are cohomologous if they 
differ by a coboundary. It is quite easy to construct a function (7 : T — )• M that is not 
cohomologous to a constant. Take two closed geodesies a and /3 that do not intersect 
on T, and let 5 : T — M be any C°°, nonnegative function that is identically 1 along a 
but vanishes in a neighborhood of /3; then by the criterion established above, g cannot be 
cohomologous to a constant. In fact, the existence of non-intersecting closed geodesies 
yields the existence of a large class of functions that are not cohomologous to constants: 

Proposition 8.2. Let e > be the distance in T between two non-intersecting closed geodesies 
a and (3. Then no C°°, nonnegative, function g : T ^ M. that is not identically zero and whose 
support has diameter less than e is cohomologous to a constant. 



(68) 




(69) 



Snf{x) = Snipix). 



F{x) 
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Proof. By hypothesis, g vanishes on at least one of the geodesies a, /3. Because closed 
geodesies are dense in ST, their projections are dense in T. Thus, since g is not identically 
0, there is a closed geodesic on which the average value of g is positive. □ 

Remark 8.3. That there exist pairs of non-intersecting closed geodesies on any negatively 
curved surface can be proved using the conformal equivalence of Riemannian metrics 
discussed in section lT2l above. First, elementary arguments in hyperbolic geometry show 
that there are non-intersecting closed geodesies on any surface of constant curved -1. 
Next, Proposition 13.31 implies that for any Riemannian metric Iqi of variable negative 
curvature on a compact surface T there is a smooth deformation of gi to a constant- 
curvature metric qq through metrics Qs of negative curvature. In this deformation, the 
closed geodesic in a given free homotopy class deforms smoothly; moreover, transversal 
intersections can be neither created nor destroyed. Therefore, if 70, 71 are non-intersecting 
closed geodesies relative to qq, then the closed geodesies 79, 7^ in the corresponding free 
homotopy classes are also non-intersecting. 

8.4. Verification of Hypothesis l5.1[ It remains to show that the function /i^ : S x S — )• 
defined by !|65)l satisfies Hypothesis 15.11 relative to the measure A, or to some equivalent 
(mutually a.c.) probability measure. For the same reason as in section [6] (see in particular 
Lemma |6!2]| we must allow for adjustment of the Poincare section of the suspension. Thus, 
for small r > define 

Kp{x,y) = ip{zr{x,y))h''{x,y) 

where /i^ = 1 if the geodesic segments corresponding to the suspension flow segments 
and intersect at a point z = Zr{x, y) € T, and W = dii these segments do not intersect. 
Then the representation ((66)) holds with /i^ replaced by /i^, and the Hoeffding projection 
of /i^ will again be given by ((68)) . but with 

F{x) f-F{x)+r 

replaced by / 

For the verification of Hypothesis 15. 1[ we use the decomposition 

00 

(70) /i^(x,y) = ^ + V'So(^>2/) where 

m=l 

m 

'^i^j{x,y) = min{/i^(x',y') : x ■ = and y'i = yi V |i| < m}. 
i=i 

The functions ^^e obviously symmetric, since is, and V'm depends only on the co- 
ordinates \i\ < m. Moreover, each ip'^ is nonnegative, and J2m. V'm + V'L = bounded 
by II99II00. Hence, (HO), (HI), and (H2) of Hypothesis jSl] all hold, leaving only (H3). 

Lemma 8.4. There exist constants g < /3 < 1 such that for all large m, 



(71) 



ijln(.^,y) < 13 
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unless the geodesic segments p o tt^T^) and p o vr( J'p intersect at angle less than g"^ or at a point 
Zr{x, y) within distance of one of the end-points of one of the geodesic segments. 

Proof. For ease of exposition we shall discuss only the case r = 0; the general case can be 
handled in the same manner. The semi-conjugacy vr : Si? — )• ST is Holder continuous, so 
there exists a < 1 such that if two sequences x, x' G S agree in coordinates |i| < m then 
7r(x, s) and 7r(x', s) are within distance a*" for all s G [0, F{x) f\F{x')], and |F(x) — F(x')| < 
a"\ at least for sufficiently large m. 

Suppose now that Xj = x^ and = y[ for all |i| < m, and that the geodesic segments 
p o tt{Fx) and Fy intersect at an angle not smaller than and at a point z(x, y) not within 
distance g^ of one of the endpoints. Then by (A3)' of Remark [3!8l (section |3.3l) , the geodesic 
segments p o ■k{Fx') and Fyi will also intersect. Furthermore, ii a < g (as we may assume 
without loss of generality) then the intersection point z{x' , y') will lie within distance /S™ 
oi z{x,y), for some /3 < 1. Since (p is smooth, it follows that for some C < oo depending 
on the Ci —norm of 

\^{z{x,y))-^{z{x',y'))\<Cr- 

□ 

Lemma [8!4l implies that to prove condition (H3) of Hypothesis l5.1[ translation invariant 
suffices to establish the inequality ((56)l . But this has already been done, in Lemmas l6.2146.7[ 
This yields the following result. 

Proposition 8.5. For any Gibbs state p, the functions ■i/'m satisfy (H3) of Hypothesis 15. 1 1 relative 
to pfor almost every r in some interval [0, r*] of positive length r*. 

8.5. Proof of Theorem |1.2[ Proposition 18.51 implies that after appropriate modification 
of the Poincare section of the suspension flow, the function /i<^ in the representation ((66)l 
of the localized self-intersection count meets the requirements of Corollary 15.51 Proposi- 
tion 18.21 implies that if the support of : T — > M+ is less than the distance between two 
non-intersecting closed geodesies, and if 99 > is smooth and not identically 0, then the 
Hoeffding projection /li^ of /i<^ relative to the measure A is not cohomologous to a constant 
multiple of F. By the argument of section lSH it follows that the second case of Theorem l5.3l 
(cf. relation (f48|) ) applies. 

□ 

9. [/-STATISTICS AND RANDOMLY CHOSEN PERIODIC ORBITS 

9.1. An Extension of Theorem |5.3[ It is well understood that the distribution of periodic 
orbits in a hyperbolic dynamical system is, in a certain sense, governed by the invariant 
measure of maximal entropy. There are two aspects of the connection. First, according 
to Margulis' prime orbit theorem (and its generalization in l|27| ) the number of periodic 
orbits of minimal period less than T grows like e^'^ /{6T), where 9 is the entropy of the 
max-entropy measure 1.23,1 , [27,| . Second, the empirical distribution of a random chosen 
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periodic orbit (from amor\g those with minimal period less than T) is, with high prob- 
ability, close to the max-entropy measure in the weak topology on measures (see Il20| . 
Theorem 7). It is the latter connection that is primarily responsible for Theorem ll.3[ 

In this section we will formulate and prove an extension of Theorem l5.3l for [/—statistics 
of randomly chosen periodic orbits of a suspension flow. This result will be combined 
with the results of section|3]on symbolic dynamics for geodesic flows to prove Theorem ll.3l 
in section [TOl 

Fix a topologically mixing suspension flow on a suspension space S p over a shift (S, a) 
of finite type with a Holder continuous height function F > 0. The invariant probability 
measure of maximal entropy for the suspension flow is the suspension ^J*^Qp (cf . sec. l4.3|) of 
the Gibbs state fi-er, where 9 is the unique positive number such that Pressure(— 6*^) = 0. 
The value 9 is the topological entropy of the suspension flow (see, e.g., [27], [20]). 

Say that a sequence x € S represents an orbit 7 of the suspension flow if the point 
{x, 0) e Si? lies on the path 7. If x € S is a periodic sequence then all of its cyclic shifts 
represent the same periodic orbit p = px of the suspension flow, and these are the only 
representatives of p. Theorem 7 of [201 implies that for large T nearly 100% of the periodic 
orbits of the suspension flow with minimal period approximately T have the property that 
their representative periodic sequences have minimal period T/E^_gpF + o(T). Hence, 
for large T nearly all of the periodic orbits of the flow have T/E^_gpF + o{T) representa- 
tive sequences. This implies that the uniform distribution on periodic orbits of the flow 
with period ^ T is nearly identical to the image (under the natural correspondence) of 
the uniform distribution on periodic sequences x such that 57^(2.)F(x) « T. (Recall that 
r(x) = tt{x) is the smallest integer n such that SnF{x) > T, and Rt{x) = <S'^(^)F(2;) — T 
is the overshoot.) Thus, we now change our focus from periodic orbits of the flow to 
periodic sequences. 

A periodic sequence x G S represents a periodic orbit of the suspension flow with 
minimal period between T and T -|- e if and only if (a) the period of the sequence x is 
r(x), and (b) Rt{x) < e. Denote by Bt^s the set of all periodic sequences satisfying these 
conditions. This set is finite: in fact, Margulis' prime orbit theorem and the law of large 
numbers cited above (or, alternatively. Theorem 1 of f20| ) imply that 

(72) \BT,e\ ~ Ce^^(e^^ - 1) as T ^ 00 
for a constant C > independent of T and e. Define 

(73) i^T,e = uniform probability distribution on -Bt.e- 

Our objective in this section is to extend the results of Theorem 15.31 to the family of mea- 
sures i'T,e- These results concern the large-T limiting behavior of the distribution of the 
[/—statistics Ut defined by 

t{x) t{x) 

Ut{x) ■.= ^^h{a'x,a^x). 

i=i j=i 
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Since ultimately we will want to use these results to prove that the distribution of self- 
intersection counts of closed geodesies converges, it is important that they should hold 
for functions /i : S x S — )• R that are not necessarily continuous. The following relatively 
weak hypothesis on the function h is tailored to the particular case of self-intersection 
counts. For any periodic sequence x with minimal period t{x) = tt{x) and any integer 
m > 1 define 



where the maximum is over all sequences G S (not necessarily periodic) such that 

x'^ = Xi for all —m < i < t(x) + m. 

Hypothesis 9.1. For each e > there exist positive constants — > as m — ^ oo such that for 
all sufficiently large m> I and T (i.e., for T > tm,e)r 



If h is Holder continuous on S x E then Hypothesis 19.11 is trivially satisfied, because 
in this case [/ will be uniformly bounded by P^T, for some < /3 < 1. In section [7| 
we will show that the hypothesis holds for the function h in the representation (|26)) of 
self-intersection counts. 

Theorem 9.2. Assume that /i : S x S — > R satisfies Hypothesis \9.1\ and also Hypothesis 15. 1 [ for 
the measure A = fi-ep- Let be the Hoeffding projection of h relative to fi^gp (cf equation 
(|67)) ), and let IJt and fx he the renormalizations of Ut and tt defined in (|47)) , (|48)l , and (|46ll . 
(In particular, if /i+ is cohomologous to a scalar multiple aF of the height function F then Ut is 
defined by (|47)l , but otherwise it is defined by l|48)) .) Then as T ^ oo the joint distribution of Ut, 
Tt, Rt, X, and a'^'^^^\x) under VT,e converges, and the limiting joint distribution of IJt and tt is 
the same as under fi-oF (that is, by l|47)) and (|48ll ofTheorem \5.3h 

The remainder of this section is devoted to the proof of this theorem. Observe first that 
it suffices to prove the result for small values of e > 0, because for each T > and each 
integer M > 1 the measure UT^e is a convex combination of the measures i'T+ie/M,e/M- 
Since vt,£ is supported by a finite set of periodic sequences, it is mutually singular with 
respect to any Gibbs state, and so Corollary 15.61 does not apply directly to VT,e', however, it 
does apply to absolutely continuous probability measures. Therefore, our strategy will be 
to show that for small e > the uniform distribution vt,£ can be approximated weakly by 
a measure absolutely continuous relative the Gibbs state fi-er- Hypothesis l9.1l will ensure 
that the distribution of Ut under the adjusted measure is close to its distribution under 
z^T,£/ and so Theorem 19.21 will follow. 

9.2. Skeleton of the Proof. To show that ut,£ is close in a weak sense to a probability mea- 
sure absolutely continuous with respect to iJ.-eF, we will partition the support Bt^e of i^t,s 
into subsets on which the "likelihood function" of the measure fi-gp (cf. equation (|33)) ) is 



t(x) t{x) 



(74) 




i=l j=l 



(75) 



UT,e{^TU{x) > EmT} < E, 
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nearly constant. See Proposition 19.31 below for a precise statement. This will imply that 
the uniform distribution on each set A of the partition is weakly close to the normalized 
restriction of fi-op to the cylinder set of all sequences in S that agree with some element 
X G A in coordinates —m < j < r(x) + j, for some large m. 

To define the partition, for each x G S and m > 1, let BT,e,m,x be the set of all y G Bt,s 
such that 

(76) yj = Xj and yr{y)+j = Xr(x)+j for all j G [-m,m\, 

and let i^t e mx be the uniform distribution on Bx £ m X- 

Clearly, BT,e,m,x depends on x 
only by way of the coordinates 2;[_m,m]/ arid the sets BT,e,m,x are pairwise disjoint, so they 
partition Bt,s- Thus, for each m > 1, 

/rrrys \ ^ \BT,£,m,x\ 

{77) l^T^e = y ^ —TT> ^^T,e,m,x, 

^ \BT^e\ 

[ — m,m] 

where the sum is over all admissible sequences that is, sequences obtained by 

restricting sequences x G H. Theorem 1 of pO[ implies that for each x G H and m > 1, as 

T — )• oo, 

(78) \BT,e,m,x\ ^ fJ'-eF(J^[-m,m]{x))\BT,£\- 

Since ,„](x)) > for any admissible X[_m,m]/ this estimate implies that for each 

X[-m,m] the set \BT,£,m,x\ grows exponentially with T. In particular, for all sufficiently 
large T the set BT,£,rn,x is nonempty, and so i'T,e,m,x is well-defined. 

Proposition 9.3. There exist constants C = Cs,tm < oo and (3 G (0,1) such that for all 
sufficiently large m, all T > tm, and any two periodic sequences x, y G BT^efor which ((76)) holds, 



m 



(79) i-c^"< ''-'^g'--'-'^"';t''Cr:n:U '+^'a 

fl-gFm~m,T{y)+m] (?/)) e^^^-t-)^^ 

This follows, in essence, from the definition ( [33l > of a Gibbs state and the fact that F 
is Holder continuous. The formal proof, which relies on the connection between Gibbs 
states and the spectral theory of the Ruelle operator, is deferred to section below. The 
fact that Fr{—9F) = will be of crucial importance for this. 

Remark 9.4. It is not assumed in Proposition 19.31 that t(x) = T{y), so the number of 
coordinates specified in the two cylinder sets appearing in ((79)) need not be the same. 
Also, by definition of r = tt, the sums S^(^^-^F{x) and S^(^y^F{y) both lie in the interval 
[r, T + e], so the ratio of exponentials in ((79)) is bounded above and below by e^^"^. Thus, 
for small e the measures of the cylinder sets in ((79)) are nearly equal. 

For any integer m > 1 define YarmF to be the maximum difference \F{x) — F{y)\ 
for sequences x,y e S such that xj = yj for all |j| < m. Because the function F is 
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Holder continuous, the sequence Var^i^ decays exponentially in m. Consequently, if two 
sequences x, y G S satisfy Xj = yj for all —m < j < n + m then 



oo 

(80) \SnF{x) - SnF{y)\ < 5^ := 3 ^ VarfcF. 



The sequence 5m decays exponentially with m. 

Assume henceforth that e < minF. This guarantees that if T < SnF{x) < T + e then 
r(x) = n. Assume also that m is large enough that 55m < For each m > 1 and x € S, 
define AT,£,m,x to be the set of all sequences y G S that satisfy ((76)) and are such that 
< Rxiv) < £, and define XT,e,m,x to be the probability measure with support AT,e,m,x 
that is absolutely continuous relative to fi^e p with Radon-Nikodym derivative 

(81) ^^^^(z) = C7e^^-(^)/^,_(.)(z), 

where C = CT,e,m,x is the normalizing constant needed to make \T,e,m,x a probability 
measure. In section 1931 below we will show that when m and T are large the set AT,£,m,x 
nearly coincides with 

^T,e,m,x •= U ^[-m,T(y)+m](y) 

and so Proposition l9.3l will imply that the probability measure \T,e,m,x distributes its mass 
nearly uniformly across the cylinder sets in the union A'^ smx- Using this, we will prove 
that for large m the measure XT,e,m,x is close in the weak (Levy) topology to i'T,e,m,x- It will 
be most convenient to formulate this statement using the following coupling metric for the 
weak topology on the space of Borel probability measures. See ||37|| or ||16J for a proof that 
the coupling metric generates the weak topology. 

Definition 9.5. Let Qa,Qb be Borel probability measures on a complete, separable metric 
space {X,d). The coupling distance dc{QA,QB) between Qa and Qb is the infimal k > 
for which there exists a Borel probability measure Q onX x X with marginals Qa and Q b 
such that 

(82) Q{{x,y) : d{x,y) > k} < k. 

Since this definition requires a metric on X, we must specify a metric for the case X = 'E, 
so henceforth we will let d = d^, he the metric d{x, y) = 2~^^^'y^ where n{x, y) is the 
minimum nonnegative integer n such that Xj / yj for j = in. 

Proposition 9.6. There exist constants C = Cs,tm < oo and < /3 < 1 such that for all 
sufficiently large m and T > tm, and all x G S, 

(83) dc{uT,e,m,x, XT,s,m,x) < Ce^T 

Consequently, for all e > 0, all sufficiently large m and T, 

(84) dc{iyT,e,XT,e,m)<2Cr 
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where 

(85) Ar,£,m := ^ tJ'-eF{'^[-m,m]{^))'^T,e,m,x- 

X — m,m 

The proof is given in section l93] below. 

Lemma 9.7. Fix e > and m > 1 large enough that 56m < £• Then for each x G T. the con- 
clusions of Corollary \5.6\ hold for the family of probability measures {XT,e,m,x)T>i- Consequently, 
they hold also for the family {XT,e,iri}T>i- 

Proof. We must show that the Radon-Nikodym derivatives in equation (jHTj) have the form 
(|54]l . By definition, the set AT,£,m,x consists of all sequences z G S such that Rt{z) G (0, e] 
and such that (|76)) holds (with y = z). Hence, the likelihood ratio ((HT]) can be factored as 



e 



This is clearly of the form (|54]l . Although the function in this factorization is not con- 
tinuous (because of the indicator /(o,e])/ it is piecewise continuous and bounded, so by 
Remark [52] the conclusions of Corollary 15.61 are valid for the family {\T,e,m,x)T>i- Since 
each of the probability measures Xx ^ rn 

is a convex combination of the measures Xt ^ m x 
(cf. equation ((85)) ). it follows that the conclusions of Corollary 15 . 61 hold also for the family 

{XT,e,m]T>l- □ 

Proof of Theorem 19.21 Given Proposition l9 . 61 and Lemma 19^71 Theorem l9.2l follows routinely. 
Lemma W7\ implies that for all sufficiently large m the conclusions of Theorem 19.21 hold if 
the measures VT,e are replaced by XT,£^m- Thus, to complete the proof, it will suffice to 
show that for any 5 > and any continuous, boimded fimction : M'^ x — >• M, there 
exist m sufficiently large such that 

(86) \misviY>\Exj, ^ - E^,^ < 5 where ^{x) = Lp{tjT,fT,RT,x,a'^^''^ x). 

By Proposition I9.6[ for all sufficiently large m and T there exist Borel probability mea- 
sures Q = QT,e,m on with marginals ut^s and XT,s,m such that ((82)) holds with k = 
2Ce'^^/3'". Now for any probability measure Q on S x S with marginals z^TjE arid XT,e,mr 

Ex^^^^-E,^^^= [ i<Pix) -^y))dQix,y). 

By ((82)), 

|$(x) - <P{y)\ dQ{x,y) < HmWHoo, 



1 

Jd(. 



'd{x,y)>K.„ 

where Km = 2Ce^^/3'". By choosing m sufficiently large, we may arrange that k^H'I'IIoo < 
6/2. Thus, to prove l(86)l we must bound the integral of |<I>(2;) — ^{y)\ on the set of pairs 
{x, y) such that d{x, y) < Km. This is where Hypothesis 19. II will be used. 
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None of the functions Ut{x), tt{x), nor Rt{x) is continuous in x. However, if m is 
sufficiently large that 55m < £ then inequality l|80)l implies that if d{x, y) < 2~™ then 
t{x) = T{y) and {Rrix) — RT{y)\ < unless 

Rt{z) G [0, 5m] U [e — 5m] for either z = x or z = y. 

Since Q has marginals ut^e arid XT,e,m, the estimates !|72]l (for the first marginal UT,e) arid 
(ST) (for the second marginal XT,e,m, using the fact that this measure is absolutely contin- 
uous relative to fj^^ep) imply that for each e > there exist constants < oo and tm < oo 
such that if m is large enough that 55m < £ and T > tm, then 

Q{{x, y) : Rt{x) G [0, 5m] U [e - 5m]} < Ce5m and 
Q{{x,y) : Rriy) G [0,5m] U [e - 5m]} < Ce5m- 

Now if t{x) = T{y) then depending on whether or not /i+ is cohomologous to a scalar 
multiple of the height function F (cf. equations ((47)) - (|48ll ). 

t{x) t{x) 

i=i j=i 

for either a = 1 or a = 3/2. In either case. Hypothesis 19.11 ensures that there exist positive 
constants Em ^ ^ such that for all large m and T, 

Q{{x,y) : t{x) = T{y) and \Ut{x) - Uriy)] > £m} < em- 

Since the sequences Em, 5m, and Km all converge to as m — )• oo, it now follows from the 
continuity of ip that for sufficiently large m and T, 

[ Mx)-<^{y)\dQ{x,y) <5/2. 

J d{x,y)<Km 

□ 

9.3. Proof of Proposition 19.61 In this section we show how Proposition |9i6] follows from 
Proposition l9.3l The proof of Proposition l9.3l is given in section l9!4l below. 

Fix E G (0, min F) and let m be sufficiently large that 55m < £• For each x G -Br.e define 

(87) ^T,e,m,x= U 5][„m,r(j/)+m](y)- 

Since BT,e,m,x depends only on the finite subsequence X[_m^m]f the same is true of the 
set m X- "'"he estimate ((78)) implies that for large T there will be many representative 
sequences x' G BT,£,m,x for which 25m < Rt{x') < e — 4:5m) assume henceforth that x 
is such a sequence, that is, that x G -BT+25m,e-4<5m- Then by ((80)) . for any y G BT,e,m,x it 
must be the case that r(y) = r(3;); hence, the cylinder sets in the union ((87)) are pairwise 
disjoint. Therefore, there is a well-defined mapping z i-^ z from smx BT,£,m,x that 
sends each z to the element y G BT,e,m,x that indexes the cylinder set of the partition ((87)) 
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which contains z. This mapping has the property that no point z is moved a distance more 
than 2^"* (in the usual metric d = on S); in fact, 

(88) dY,{(y'z,a'z) < 2-"" for all -m<i< t{z) + m. 

For each x € 3^^3^,6-5^ define X^e mx to be the probability measure on ^ that 

is absolutely continuous with respect to ji^gp with Radon-Nikodym derivative 

r/A* 

m ^|^(^) = c^T.,.,.e--^(%.,,^,„,^(.), 

where C*{T,e,m,x) is the normalizing constant needed to make X^^^x ^ probability 
measure. The Radon-Nikodym derivative is constant on each of the cylinder sets in the 
partition (|87l ), and its values on these cylinder sets are chosen so as to cancel the exponen- 
tial factors in ((79)) . Consequently, by Proposition l9.3[ if y, z G BT,e,m{x) then for constants 
Ce < oo depending only on e, 

(90) 1 - c,r < ^^'"^'"^'""^'"^"^'-"^1^11 < 1 + 

'^T,e,Tn,xi'^l-m,T{y)+m\ (?/)) 

Corollary 9.8. Assume that x G -Bt+2<5„,e-25™- Let X^T,e,m,x be the push-forward of the prob- 
ability measure Xxemx under the mapping z ^ z (that is, the distribution of z when z has 
distribution Xxem x^- Then for suitable constants C = Cg < oo and /3 G (0, 1) not depending on 
T, m, or X, 



(91) 



dX^T,e,m,x ^ 



dvT,i 



and consequently, 

(92) dc{\^T,e,m,x, ^T,e,m,x) < 1 " (1 " C D'^ ■ 

Proof. The first inequality is a direct consequence of ((90)) . The second follows from the first 
by the following elementary fact: if v, p. are mutually absolutely continuous probability 
measures whose Radon-Nikodym derivatives du / dp and dp / dv are both bounded below 
by ^> G (0,1], then their coupling distance is no greater than \ — q. □ 

Since the mapping z ^ z moves each z by a distance at most 2~^, coupling distance 
between the probability measures \^T,e,m,x arid \**T,e,m,x is at most 2^*". By Corollary 19.81 
the coupling distance between AV,e,m,x and VT,£,m,x is at most C (3^ for a suitable C = 
< oo. Therefore, to prove Proposition |9]6] it suffices to prove the following lemma. 

Lemma 9.9. For each e > there exists a constant < oo such that for all x G S, all m large 
enough that 5m < 5e, and all large T, 

(93) dciXT,e,m,xAT,e,m,x) < Ce^m- 
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Proof. Recall that without loss of generality we may assume (for T large) that the rep- 
resentative sequence x G S is an element of i?T+2<5„i,e-2(5m- It must then be the case, 
by inequality l|80)l , that every y £ BT,e,m,x must have period T{y) = t{x), and that 
Rriy) £ (<^m,e — ^m)- This in turn implies that every z in the cylinder 
must also satisfy t{z) = t{x) and Rt{z) € (0, e). Consequently the support sets of the 
measures >^T,e,m,x ^^i^ \T,e,m,x satisfy 

(94) ^T,.,m,x C AT,e,m,x. 

Next, suppose that z e AT,£,m,x is such that Rt{z) G ((^m, e — ^m)- Let z be the periodic 
sequence with period r(z) that agrees with z in coordinates j G [—m, t{z) + m]; then 
by the same argument as above, using inequality ((80)) , we have Rt{z) G (0, e), and so z G 
BT,£,m,x- By construction, z is in the cylinder S[_m,r(2)+m] (z), so it follows that z G ^ ^ ^ 
and z = z. This proves that 

(95) AT,e,mix) \ ^T,e,m,x ^ {z : R^z) (<5„, £ - 6^)}. 

These arguments also show that for every z G AT^e,m,x such that Rt{z) G {5m,, e — 5m), and 
for every z G . m,x' 



(96) 



< e^""^ - 1. 



Relations (HHl, (|95)l , and l|96)) imply that the ratio of the Radon-Nikodym derivatives 
and ((81]) differs from 1 by less than C5m except on the set 

AT,£,m,x '■= ^ AT,e,m,x ■ Rt{z) {5m,£- ^m)}- 

But the renewal theorem (cf. inequality (dD) implies that for some constant indepen- 
dent of m, for all large T, 

ll-eF{A*rf^e,m,x) . 
Tfi V — '-^e^m- 

li'-eF\J^T,e,m,x) 

It now follows by routine arguments that for a suitable constant C'l, the total variation 
distance between the measures A^^ ^ ^ and Ar,e,m,a; is boimded above by C'lbm for large 
m and large T. This implies ((93)) . 

□ 



9.4. Proof of Proposition I9.3[ Recall that any Holder continuous function on S is coho- 
mologous to a Holder continuous function that depends only on the forward coordinates. 
Let F be the height function of the suspension, and let be a function of the forward 
coordinates that is cohomologous to F . Then —QF and —6F^ have the same topological 
pressure (which by choice of 6 is 0), and the Gibbs states fi-er and fi-eF+ are identi- 
cal. Also, for any periodic sequence a; G S with period (say) n it must be the case that 
SnF{x) = SnF^{x). Since the assertion ((79]) involves only periodic sequences and mea- 
sures of events under /i_6) p, to prove ((79)) it will suffice to prove l(79)l with F replaced by 
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F+. Thus, the representation ((37)l for the Gibbs state H-eF+ can be used; in particular, since 

Pr(-eF+) = 0, 

-j ri^eF+ 

where v and h are the right and left eigenvectors of the Ruelle operator C = C-qf^- (For 
the remainder of the proof we will drop the subscripts onh,^, n, and C.) 

The measure ^ is shift-invariant, so the cylinder sets in ([79l l can be shifted by (t~^, and 
hence can be regarded as cylinder sets in the one-sided sequence space S+. Fix a periodic 
sequence x of period n > 2m + 1; then 



hd{{C*Y+'^u) 



Here we have used the fact that v is an eigenmeasure of the adjoint £* of the Ruelle 
operator with eigenvalue 1. Next we use the definition of the Ruelle operator ( ilOL ch.l 
sec. B) to write, for any z € S+, 

^[Q,n+2m]\^r^ ^ ' ^ [0.n + 2m] 

J/GS+:(t" + ™'J/=Z 

The indicator function in this expression guarantees that the only z G S+ for which there 
is a nonzero term in the sum are those sequences such that Zj = Xj for j G [0, 2m]. (Keep 
in mind that x is periodic with period n.) For each such z there is exactly one y G S"*" 
for which the summand is nonvanishing, to wit, the sequence {x\z)2m->r\ '■= ^oxi ■ ■ ■ X2mZ 
obtained by prefixing to z the first 2m + 1 letters of x. Consequently, 



K^[0,n+2m]{x)) = / exp{-e Sn+mF {{x\z)2ra+l)}h{{x\z)2m+l)) diy{z) 

= e-^^"+-^(-)^(S[o,2™](x))(l ± 

for suitable constants C < oo and < /3 < 1 independent of x. The final approximate 
equality follows from the Holder continuity of F and h, together with inequality l|80l) . It 
now follows that if x, x' are any periodic sequences with periods n = T{a"^x) > 2m + 1 
and n' = T{a'^x') > 2m + 1 such that X[o,2m] = ^^'o 2m] then 

/^(S[0,n'+2m](x')) e-^Wi^C-')^ ^ ^ 
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for suitable C ,C" < oo. This implies relation ((79)). 

□ 

10. Proof OF Theorem O] 

The results of section |3] imply that for any compact surface T equipped with a smooth 
Riemannian metric of negative curvature the geodesic flow on ST is semi-conjugate (by 
a Holder continuous mapping vr : Si? — ST) to a suspension flow {'EF,4't) over a shift 
of finite type. All but finitely many closed geodesies correspond uniquely to periodic or- 
bits of this suspension flow, and for each of these the self-intersection count is given by 
equation ((26)l , or by equation l|26ll with h replaced by /i**, for some small r > 0. By Propo- 
sition lOl there exist values of r such that the function satisfies the hypotheses of Theo- 
rem [53] relative to any Gibbs state; for simplicity we will assume that the Poincare section 
of the suspension has been adjusted so that r = 0. If the Riemannian metric on T has 
constant curvature then the normalized Liouville measure for the geodesic flow coincides 
with the maximum entropy invariant measure, and so in this case Lemma lTH implies that 
the Hoeffding projection of h relative to the Gibbs state A = fi-gp is a scalar multiple of F. 
Therefore, Theorem 11.31 will follow from Theorem 19.21 provided that Hypothesis 19 . 1 1 can 
be verified. This we will accomplish by reducing the problem to a problem about crossing 
rates. 

The following lemma asserts that for compact surfaces of constant negative curvature, 
the ergodic law (O for intersections with a fixed geodesic segment extends from random 
geodesies to closed geodesies. Denote by Xt,£ the uniform distribution on the set of all 
(prime) closed geodesies with length in [T,T + e], and let = l/47r|T|. For any geodesic 
arc a, let |q!| be the length of a. 

Lemma 10.1. Assume that T is a compact surface with a Riemannian metric of constant negative 
curvature —1. For any geodesic segment a and any closed geodesic (3 let N{a; j3) be the number 
of transversal intersections of (3 with a. Then for every geodesic segment a, all sufficiently small 
e > 0, and all 5 > 0, 

(97) lim XTe{P ■■ \N{a;^) -Kr\f3\\a\\> 6T} = 0. 

T—^oo ' 

Proof. This follows from Theorem 7 of [211 by the same argument used to prove the er- 
godic theorem for self-intersections (Theorem 1 of ||2T]| ). The value of the constant kt 
follows from Lemma l2!2l since in constant curvature the Liouville measure and the maxi- 
mum entropy measure coincide. □ 

Hypothesis 19.11 concerns the quantity A™C/(x) defined by equation ((74)) . For any peri- 
odic sequence x with minimum period r(x) = tt{x) and any integer m this quantity is the 
maximum difference in self-intersection count between (a) the closed geodesic Gx corre- 
sponding to the periodic orbit of the suspension flow through (x, 0) and (b) any geodesic 
segment Gy = {tt o 4>t{y,0))o<t<Sr(x)F{y) where y is some sequence that agrees with x in 
coordinates —m < i < t{x) + m.\im is large, any two such geodesic segments are close. 
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because the semi-conjugacy between the suspension flow and the geodesic flow is Holder 
continuous. This can be quantified as follows. 

Lemma 10.2. There exists yl > such that for all n > 1, all sufficiently large m, and all -pairs 
x,y £ Ti such that Xi = Uifor —m < i < n + m, 

(98) d(7r(</)t(x,O)),7r(0t(y,O))) < e"^'" for all 0<t< tt{x). 

Proof By definition of the "taxicab" metric on Si? (cf. [12]), the orbits (/>t(x, 0) and (ptiy, 0) 
must remain within distance e~^"^, for a suitable constant B > 0. Because the semi- 
conjugacy vr is Holder continuous, the projections tt o (^^(x, 0) and vr o (^^(y, 0) must remain 
within distance e^^"^. □ 

Remark 10.3. The lengths of the segments Gx and Gy will in general be different, because 
need not equal ^^(^^^(x). However, the difference in lengths can be at most 6m, 
where dm is given by ((80)l , which decays exponentially in m. 

Proof of Hypothesis 19. 1 1 Fix geodesic segments Gx, Gy as above, and consider the difference 
in their self -inter section counts. To estimate this, consider how the continuous changes as 
Gx is smoothly deformed to Gy though geodesic segments by smoothly moving the initial 
and final endpoints, respectively, along smooth curves Cq and Ci . In such a homotopy, 
the self-intersection count will change only at intermediate geodesic segments Gz along 
the homotopy where one of the endpoints passes through an interior point of the segment. 
Now the geodesic segment Gz remains within distance e~^"^ of Gx, by Lemma [10.21 and 
Remark 110.31 so for any interior point of Gz that meets (say) the initial endpoint of Gz, 
the corresponding point on Gx must be within distance e^^*", and hence within distance 
g-Am q£ |.]^g curve Go- In particular, this corresponding point on Gx must fall inside a 
small rectangle Rq surrounding Gq whose sides are geodesic arcs. Since the lengths of Gq 
and Gi are bounded above by e~^"^ + 6m (by Lemma [l0.2l and Remark ll0.3|l the rectangle 
Rq can be chosen so that its sides all have lengths bounded by e~^'™. 

This proves that the difference in the self-intersection counts of Gx and Gy is bounded 
above by the number of crossings of dRo and dRi, where Ri are rectangles bounded by 
geodesic arcs of length < e^^ ™. Hence, Lemma [lO.li for most periodic sequences x of 
minimal periods T this difference is bounded above by (8 + e)e"'^''"T when T is large. 
This implies Hypothesis 19.11 □ 
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